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Abstract 

The recently proven Quantum Lovasz Local Lemma generalises the well-known Lovasz 
Local Lemma. It states that, if a collection of subspace constraints are "weakly dependent", 
there necessarily exists a state satisfying all constraints. It implies e.g. that certain instances 
of the fc-QSAT quantum satisfiability problem are necessarily satisfiable, or that many-body 
systems with "not too many" interactions are always frustration-free. 

However, the QLLL only asserts existence; it says nothing about how to find the state. 
Inspired by Moser's breakthrough classical results, we present a constructive version of the 
QLLL in the setting of commuting constraints, proving that a simple quantum algorithm 
converges efficiently to the required state. In fact, we provide two different proofs, one using 
a novel quantum coupling argument, the other a more explicit combinatorial analysis. Both 
proofs are independent of the QLLL. So these results also provide independent, construc- 
tive proofs of the commutative QLLL itself, but strengthen it significantly by giving an 
efficient algorithm for finding the state whose existence is asserted by the QLLL. We give 
an application of the constructive commutative QLLL to convergence of CP maps. 

We also extend these results to the non-commutative setting. However, our proof of the 
general constructive QLLL relies on a conjecture which we are only able to prove in special 
cases. 
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1 Introduction and Background 

The Lovasz Local Lemma (LLL), proven by Erdos and Lovasz [1975], is a well-known and widely- 
used result in probability theory. It states that if individual events are "not too" dependent 
on each other and occur with "not too high" a probability, then there is a positive probability 
that none of them occur. This is a non-trivial extension of the trivial fact that, if the individual 
events were completely independent, and if none of them occurred with certainty, then there 
would be a positive probability that none of them occur. The LLL gives tight bounds on just 
how dependent the events are allowed to be and how high the probability of the events can be. 
In its most general form, it states: 

Theorem 1 (Erdos and Lovasz [1975]) Let j4i, A2, . . . ,Am be events in a probability space, 
and let T{Ai) denote the set of events that are not independent of Ai, excluding Ai itself. If 
there exist values < xi, 2:2, . . . , Xm < 1 such that 

'ii:Y>T{Ai)<x,-Y[{l-x.j), (1) 

Aj6r(A,) 

then the probability that none of the events occur is at least Y\^{\- — Xi). (In particular, it is 
positive.) 



2 



Applications of the LLL abound [Alon and Spencer, 2008]. It is frequently invoked in order 
to prove existence of some mathematical object via the probabilistic method. For example, 
it can be used to prove existence of solutions to boolean satisfiability problems. The fc-SAT 
problem asks whether there exists an assignment of truth values to a set of boolean variables 
that satisfies a boolean expression in conjunctive- normal form (CNF), where each clause in the 
CNF formula involves at most k variables. Letting Ai be the event that the i^^ clause is violated 
when the assignment is chosen at random, the LLL implies: 

Corollary 2 (Symmetric Lovasz Local Lemma) If each variable in a k-CNF formula ap- 
pears in at most 2^^ /{e • k) clauses, then there exists a satisfying assignment for the formula. 

Indeed, this amounts to a symmetric, uniform version of the full Lovasz Local Lemma. 

Recently, by replacing events with subspaces, and probabilities with relative dimensions, 
Ambainis, Kempe, and Sattath [2009] succeeded in generalising the LLL to the quantum 
setting. The relative dimension R{X) of a subspace X in a vector space V is the ratio 
R{X) = dmi{X)/ dim{V). Two subspaces Xi,Xj are said to be R-independent if R{Xir] Xj) = 
R{Xi)R{Xj). In its most general form, the Quantum Lovasz Local Lemma (QLLL) states: 

Theorem 3 (Ambainis, Kempe, and Sattath [2009]) 

Let Xi,X2, ■ ■ ■ ,Xm be subspaces, and let r(Xj) denote the set of subspaces that are not R- 
independent of Xi, excluding Xi itself. If there exist values < xi,X2, ■ ■ ■ ,Xm < 1 such that 
the relative dimensions R{Xi) satisfy 

R{Xi)>l-x,-J[{l-Xj), (2) 

then R{{^-Xi) > Wi{^ — Xi). In particular, the intersection Hi^* ^^^^ positive dimension. 

(Note that, because of the properties of R-independence under orthogonal complement, the 
QLLL has to be stated "the other way around" to the LLL, so that the subspaces correspond 
to events that one does want to occur. See Ambainis, Kempe, and Sattath [2009, Lemma 11], 
and the discussion thereafter.) 

Viewed from one perspective, the QLLL has little to do with quantum physics; rather, it is 
a mathematical generalisation of the standard LLL to a geometrical setting. However, viewed 
from another perspective, the quantum version is closely related to current topics of physics 
research. 

For example, just as the LLL can be applied to /c-SAT problems, the QLLL can be applied to 
its quantum generalisation, /c-QSAT [Bravyi, 2006]. Boolean variables become qubits, clauses 
in a CNF formula become projectors Ilj that act non-trivially on k qubits, and the /c-QSAT 
problem asks whether there is a state of the qubits satisfying Vi : Ilj \ip) = 0. The QLLL 
implies: 

Corollary 4 (Ambainis, Kempe, and Sattath [2009]) 

Let Hi, 112, ■ ■ ■ )IIm be a /c-QSAT instance where all projectors have rank at most r. If each 
qubit occurs in at most 2^ /{e ■ r ■ k) projectors, then there exists a satisfying state for the 
fe-QSAT instance. 

An equivalent way of expressing the /c~QSAT problem is to ask whether the Hamiltonian 
H = ^ X^i has a zero-energy ground state. Since the Ilj are positive-semidefinite, this is 
equivalent to asking whether the ground state of the overall Hamiltonian is simultaneously the 
ground state of all the individual local terms; i.e. we are asking whether or not the ground state 
is frustrated. Replacing the projectors Hj with positive-semidefinite local Hamiltonian terms hi 
whose support (coimage) is Hj does not affect whether the ground state is frustrated. So the 
/c-QSAT problem amounts to asking whether the ground state of an interacting many-body 
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system is frustrated or not. The QLLL implies that many-body systems in which each particle 
interacts with "not too many" others are never frustrated. 

The LLL and its quantum counterpart assert existence, e.g. of a satisfying assignment. But 
they give no indication as to how to find this assignment. In a breakthrough result, Moser 
[2009] gave a beautiful proof of a constructive version of the classical LLL. This not only gives 
an independent proof of the LLL, but does so by providing an efficient algorithm for finding the 
point in probability space whose existence is asserted by the LLL (e.g. the satisfying assignment 
to a /c-SAT problem). 

Moser [2009] originally proved this for the symmetric LLL, achieving the tight asymptotic 
scaling but not quite achieving the tight constant in the bounds of Corollary 2. With Tardos, 
he quickly generalised his proof to cover the general LLL and match the tight constants of The- 
orem 1 [Moser and Tardos, 2010]. The proof imposes a very slight restriction, in requiring that 
the events in the LLL be determined by different subsets of underlying, mutually-independent, 
random variables: 

Theorem 5 (Moser and Tardos [2010]) Letpi,p2, . . . ,Pn be mutually-independent random 
variables, and let Ai,A2,... ,Am be events determined by these variables. If there exist values 
< xi, X2, ■ ■ ■ , Xm < 1 such that 



then there exists an assignment of values to the variables pi such that none of the events Ai 
occur. Moreover, there is a randomised algorithm (Algorithm 1) that finds this assignment in 
expected time 

m 

o(" + Et^-i^.i). w 

i=l 

where \Ai\ is the number of variables involved in determining event A^. 

The algorithm is almost the simplest randomised algorithm one could imagine (Algorithm 1). 
We are looking for an assignment such that none of the events occur, so we say that an event is 
violated by an assignment if it occurs for that assignment. The algorithm maintains a register 
V = viV2 ■ ■ - Vn of assignments to the variables pi. At each step, it checks whether any event 
Aj is violated by the current assignment, and if so replaces the assignments Vi to the variables 
involved in that event by values chosen uniformly at random. It repeats this procedure until no 
events are violated. A priori it is not obvious that this process will ever terminate; Theorem 5 
proves that it in fact terminates in linear expected time! 



Algorithm 1 Classical Solver 

1: 

2: 

3: 

4: 

5: 

6: 

7: 

8: 

9: 
10: 
11: 
12: 



yi:Pr{Ai) <Xi-Y[{l-Xj), 

A,&r{A) 



(3) 



function solve_lll((74i, ■ ■ ■ , ^m)) 
for all Pi do 

a random evaluation of pi 

end 

while 3Aj violated by v do 
pick a violated event Aj; 
for all Pi S Aj do 

a random evaluation of pi ; 

end 
end while 
return v; 
end function 
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2 Results 



In this paper we prove a constructive version of the commutative case of the Quantum Lovasz 
Local Lemma. As in the constructive version of the (classical) LLL, we have to impose a slight 
restriction, in requiring that the subspaces respect an underlying tensor product structure. We 
can, without loss of generality, take the underlying structure to be the state space of a set 
of qudits, analogous to the underlying random variables in Theorem 5. Each subspace in the 
QLLL is then defined on some subset of the qudits. (More precisely, it is the extension of such 
a subspace to the full Hilbert space of all the qudits.) 

It will be convenient in the constructive QLLL to represent subspaces by projectors. We 
define the relative dimension of a projector 11 to simply be the relative dimension of the subspace 
X onto which it projects: R{Il) := rank(n)/dim(y) = dim(X)/ dim(y) = R{X). With the 
restriction to an underlying tensor-product space, two subspaces are i?-independent iff their 
corresponding projectors act non-trivially on at least one qudit in common; we say that the 
projectors intersect in this case. Conversely, two projectors that act non-trivially on disjoint 
subsets of qudits are said to be disjoint. We can simplify the notation by letting [i] denote 
the subset of qudits on which projector Ilj acts non-trivially; then Ilj and Ilj intersect iff 
W ^ [j] 7^ ^- We write r(nj) for the set of projectors that intersect with Ilj, excluding Ilj itself, 
and r+(nj) = r(nj) U {n,} for the set that includes Ilj. 

Definition 6 (Lovasz conditions) Let Hi, 112, ... , be projectors that act on subsets of n 
qudits. We say that the set of projectors {Ilj} satisfies the Lovasz conditions if there exist values 
< xi, X2, . . . , Xm < 1 such that 



We prove the following constructive version of the QLLL (Theorem 3) in the case of com- 
muting projectors: 

Theorem 7 (Constructive Commutative QLLL) Lei Hi, 112, • • • be mutually commut- 
ing projectors acting on subsets of n qudits. If {Ilj} satisfy the Lovasz conditions, then there 
exists a joint state p of the qudits such that \/i : Trpjp] = 0. 

Moreover, there is a quantum algorithm that converges to a state p' such that Tr[njp'] < 
£ (or, equivalently, such that Ti[Pqp] > 1 — e when Pq is the projector onto the subspace 
span{|'i/') : Villj |^) = 0}) in time 



where \ [i] \ is the number of qudits on which the projector Ilj acts non-trivially. 

(Note that, because we restrict to subspaces with a tensor product structure, there is no longer 
any need to state the constructive QLLL "the other way around" as there was in Theorem 3.) 
The algorithm is a generalisation of Algorithm 1 to the quantum setting, and is described in 
Section 3. 

There are two significant challenges in generalising Moser's constructive LLL to the quantum 
setting. First, the state we are trying to construct may be highly entangled, whereas the 
algorithm only has access to measurements of the local projectors Ilj. Another way of expressing 
this in terms of the Hamiltonian H defining a fe-QSAT instance is that, in the classical setting, 
we know in advance in which basis the overall Hamiltonian H is diagonal — the computational 
basis — and the local projectors Ilj are local in this same basis. In the quantum setting, the basis 
which diagonalises the overall Hamiltonian is only defined globally, and can be highly entangled. 



^(Hj) < x, 



(5) 



n,er(n,) 




(6) 
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The projectors Ilj will not in general be local in this basis, but will act non-trivially on the 
entire system. Were we to naively apply the Moser and Tardos [2010] algorithm in this diagonal 
basis, every time we measured a projector Ilj to be "violated" (i.e. we obtained the Ilj outcome 
upon performing the {Ilj, 1 — Ilj} measurement), we would have to discard the entire state and 
start from scratch. Thus a naive application of the Moser and Tardos [2010] algorithm to the 
quantum case reduces to picking a random state, checking if it satisfies a constraint, and, if 
not, discarding the entire state and trying again. This certainly cannot find the correct state 
efficiently. 

Note that this challenge remains just as problematic even if the projectors Ilj commute. In 
the commutative case of the QLLL, we may know a priori that there exists a basis which di- 
agonalizes all projectors simultaneously, but this basis is still only defined globally, the ground 
state can still be highly entangled, and the projectors can still be non-local in the diagonal 
basis. (Stabiliser states are a simple example of commuting Hamiltonians with highly entangled 
ground states [Nielsen and Chuang, 2000].) Indeed, this and related questions in the commu- 
tative setting have recently been gaining increasing attention in the context of Hamiltonian 
complexity [Aharonov and Eldar, 2011, Bravyi and Vyalyi, 2003, Schuch, 2011]. 

The second challenge comes from non-commutativity: quantum states are disturbed by 
measurement. The classical algorithm is free to check which fc-SAT clauses are currently 
satisfied, without affecting the current variable assignment. But quantum mechanically, even if 
we measure a /c-QSAT projector Ilj to be "satisfied" (i.e. we obtain the desired 1 — Ilj outcome 
upon performing the {Ilj, 1 — Ilj} measurement), the measurement can disturb the state so as 
to increase the probability of measing another Ilj to be violated. 

Here, we address and give a complete solution to the first of these two challenges: we prove 
a constructive version of the commutative QLLL (i.e. the case in which all the Ilj commute). 
A priori, it is not at all clear that Moser's proof extends even to the commutative quantum 
case, for the reasons discussed above. Nonetheless, by extending the proof techniques of Moser 
[2009] and Moser and Tardos [2010] in a more subtle way, we are able to prove a constructive 
version of the commutative QLLL. Moreover, the quantum algorithm involved in Theorem 7 is 
just the natural quantum generalisation of Moser and Tardos [2010], and almost the simplest 
imaginable (see Algorithm 2). It also coincides with the natural dissipative state preparation 
algorithm studied in Verstraete et al. [2009]. 

Whilst the algorithm is straightforward, its analysis is not. We provide two different proofs 
of the constructive commutative QLLL, using two very different approaches. The first proof, 
described in the main text, generalises the probabilistic approach of Moser and Tardos [2010]. 
The key step in the proof is the replacement of the classical coupling argument used by Moser 
and Tardos [2010], with a quantum coupling argument, which uses a coupling by entanglement. 
To our knowledge, this is the first example of a quantum coupling argument, used as a proof 
technique to establish convergence of a quantum stochastic process, which may be of indepen- 
dent interest. Even though the entanglement is not used directly as a resource by the algorithm, 
it is the unique properties of entanglement that allow us to prove via the quantum coupling 
that the algorithm can find the correct global basis even though it has access only to local 
measurements. 

Coupling arguments are long established as a very powerful proof technique in probability 
theory, often providing the simplest or even the only proof of many results [Lindvall, 2002, 
Thorisson, 2000]. Our quantum generalisation of the coupling method is no exception, providing 
an elegant and concise proof of the constructive commutative QLLL of Theorem 7. But coupling 
arguments often seem a little like "black magic". In our second proof, desribed in Section 4, 
we replace the coupling argument with a combinatorial proof. Whilst (as is often the case) 
the combinatorial argument is significantly more involved than the coupling argument, it is 
nonetheless more explicit. It demonstrates how the algorithm can be understood as a quantum 
stochastic process produced by iterated measurement, and leads to interesting new results on 
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such iterated measurement processes. 

To generalise these results to the general non-commutative QLLL requires that we ad- 
dress the second challenge: non-commutativity of quantum measurement, and the concomitant 
measurement-disturbance issue. The difficulty here is that "satisfied" measurements now play 
a significant role, and can sometimes make things worse instead of better for later measure- 
ments. Even the order in which the "satisfied" outcomes occur is now significant. We give 
a simple non-commutative counter-example which already violates the crucial bounds that we 
prove in the commuting case by both coupling and combinatorial arguments. This suggests that 
sharper techniques will be required to address this second challenge, and extend our results to 
the general, non-commutative setting. 

Nonetheless, although we are not able to give a complete proof of a constructive QLLL in 
the non- commutative setting, we can prove it if we assume a technical conjecture. We prove the 
conjecture in certain simple cases, and it is also supported in more general cases by numerical 
evidence (though the numerics we have done are limited). Furthermore, it is likely that the 
conjecture must hold if the natural quantum generalisation of Moser's classical algorithm is to 
work. If our conjecture is false, either there is no efficient constructive version of the general 
QLLL, or an entirely different approach is needed in the non-commutative setting. 

The paper is organised as follows. In Section 3 we briefly review the classical proof of Moser 
and Tardos [2010], then prove the constructive commutative Quantum Lovasz Local Lemma 
using a novel quantum coupling argument. Section 4 contains an alternative, combinatorial 
proof of this result. In Section 5 we apply the results of the previous section to bound the 
convergence time of certain classes of CP maps. In Section 6 we generalise the results to the full 
non-commuting setting, though we have to assume a technical conjecture which we are currently 
unable to prove except in some simple special cases. In Section 7 we conclude with an outline 
of applications of our results to physics and quantum algorithms, some possible generalisations 
in the commutative setting, and a discussion of open problems and potential directions in the 
non-commutative setting. 

3 A Constructive Proof of the Commutative Quantum Lovasz 
Local Lemma 

In order to build a constructive proof of the QLLL, we start by adapting the elegant argument 
of Moser and Tardos [2010] so that it applies to the commutative quantum case. Our first 
step is to generalise the Moser- Tardos algorithm in the obvious way. The quantum algorithm 
acts on an assignment register of n qudits, holding a state (or assignment) \a). (We abuse 
notation slightly by using |q) to denote both the register itself and the state of the register, 
even though that state need not be pure.) If the measurement {Ilj, 1 — Ilj} is performed on 
an assignment and yields the outcome Ilj, then we say that projector Ilj was violated. We 
denote the set of qudits on which a projector Ilj acts non-trivially by [i]. In another abuse of 
notation, the i^^ qudit of the assignment register will be denoted [aj), even though the reduced 
state of that qudit need not be pure. The algorithm will also require a uniform random source 
P = ({|0) , |1) , . . . , [d — l)},Pr(|i)) = 1/d) that emits qudits in a random computational basis 
state. We use the notation [oj) P to indicate replacing the contents of the quantum register 
I a,) with a fresh sample from the random source P. 

Our task is, firstly, to show that Algorithm 2 converges to a state satisfying the require- 
ments of Theorem 7, thereby proving that such a state exists, and, secondly, to prove that this 
convergence occurs in polynomial time. 

Before adapting the Moser and Tardos [2010] proof to the quantum case, let us review the 
high-level structure of their argument. Note that, as they proceed. Algorithms 1 and 2 keep a 
log (Lines 6 and 9, respectively) of which events or projectors were violated. For each entry in 
the log, Moser and Tardos [2010] imagine constructing a "witness tree" from all the log data up 
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Algorithm 2 Quantum Solver 



function solve_qlll((ni, 112, . . . , Hm)) 
for all i do 

end 
loop 

pick a projector 11 j uniformly at random; 
measure the projector Ilj; 
if the projector was violated then 
append Ilj to the execution log; 
for all j G [i] do 

\aj) ^ P 
end 
end if 
end loop 
end function 



to that point. (We will describe the precise procedure for constructing the trees later, but the 
details are not important for the overall structure of the argument.) Each violation adds one 
entry to the log, and each log entry is associated with a different witness tree, so the number 
of distinct witness trees that can be constructed from — or occur in — the log gives the number 
of violations seen by the algorithm. 

Therefore, in order to compute the expected number of violations seen by the algorithm, we 
would like to compute the probability that a particular witness tree can occur in the algorithm's 
log. Of course, it is not at all clear how to compute this probability directly. So Moser and 
Tardos [2010] use a coupling argument to relate the behaviour of the random process generated 
by the algorithm to a much simpler random process, whose behaviour is more easily analysed. 
They call this simple random process "r-check", because it executes a random process on a 
witness tree r, and outputs either "pass" or "fail" at the end. (Again, we will describe the 
r-check procedure later, but the details are not important for now.) 

Viewed separately, we have two different random processes: the algorithm, whose behaviour 
is difficult to analyse, and r-check, whose behaviour is more easily computed. The coupling 
is established by proving that, if these two random processes are fed the same random source, 
then their behaviour becomes correlated in such a way that if a witness tree r occurs in the 
algorithm's log, then r-check will always output "pass" , i.e. 

Pr(r-check passes | r occurs in the log) = 1. (7) 

A simple application of Bayes' theorem then implies that 

^ , ■ ^ ^ \ Pr(r occurs in the log I r-check passes) ^ , , , n /o \ 

rr(r occurs m the log) = — — j — - — - • Fr(r-check passes) (8a) 

Pr(r-check passes | r occurs in the log) 

< Pr (r-check passes), (8b) 

(using Eq. (7) and the fact that the numerator, being a probability, is upper-bounded by 1). 
Thus we can bound the probability of a particular witness tree occurring in the algorithm's 
log (which is the same as the probability of a particular violation occurring) by the probability 
that r-check passes. In order to bound the latter, Moser and Tardos [2010] relate the r-check 
procedure to a Galton- Watson branching process,^ which can be analysed straightforwardly. 



^The Galton- Watson process is a standard random process in probability theory, originally derived in order 
to model extinction of aristocratic surnames in family trees, something of great concern to the Victorians in 19*'' 
century England. 
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3.1 Witness Trees 



Imagine constructing a tree of violated projectors from the data in the log, at any point during 
the algorithm, exactly as in Moser and Tardos [2010, Section 2]. Denote projectors appearing 
in the log by H^*), for i = . . .1 (where I is the number of entries in the log). To construct 
a tree starting from entry n in the log, we work backwards from projector n^"\ First, create 
a root node labelled by H^"). For each preceding projector H^'^"') in the log, create a new 
vertex, labelling it by n(*\ and attach it below an existing vertex labelled by a projector which 
intersects with H^*^ . If there is more than one such vertex, attach it below the one furthest from 
the root (breaking ties arbitrarily). If there is no such vertex, simply discard the new vertex. 

Note two key observations about the trees constructed by this procedure. Firstly, projectors 
located at the same level in the tree can never intersect. Secondly, if a vertex labelled by II^*) 
is located at a higher level of the tree (closer to the root) than a vertex labelled by II'^-'^ and 
if projectors II^*) and II^-^) intersect, then II^*) must have occurred later^ in the log than II^-'^ 
i.e. i > j. Together, these properties imply that the tree encodes an ordering of the violated 
projectors with respect to the partial order defined by projector intersections. 

A tree is called proper if distinct children of the same vertex always receive distinct labels, 
and we say that a tree r occurs in the log if r can be generated by constructing a tree starting 
from some entry of the log. We now show that trees occurring in the log of Algorithm 2 still 
satisfy Lemma 2.1 from Moser and Tardos [2010], reinterpreting it for the quantum case in the 
obvious way: 

Lemma 8 Let t be a fixed tree and L the (random) log produced by Algorithm 2. 
(i). If T occurs in L, then r is proper. 

(a). The probability that r occurs in L is at most Yly^T-P^[^{v)], 

where Il{v) is the projector labelling vertex v, and Pr[n(f)] is the probability of measuring n(f) 
on the maximally mixed state. 

Proof Part (i) follows immediately from the above observations about the way the trees are 
constructed. (Indeed, there is nothing quantum about part (i), so the statement and proof are 
identical to the corresponding part of Lemma 2.1 in Moser and Tardos [2010].) 

To prove part (ii), we introduce a new technique, a quantum coupling argument, to establish 
a coupling between Algorithm 2 and Algorithm 3. Algorithm 3 is a straightforward quantum 
version of the classical r-check procedure. In the classical case the two processes are coupled by 
a joint random source. To establish the commutative quantum case we replace the joint random 
source of qudits of Algorithm 2 and Algorithm 3 by a source of maximally entangled qudit pairs. 
One half of the pair is provided to the each of the two coupled quantum processes. Since tracing 
out one half of each pair results in the maximally mixed state on the other half, the coupling 
is undetectable by the process acting on the remaining subsystem: the marginal distributions 
of the two coupled random processes are identical to the original, separate random processes 
Algorithm 2 and Algorithm 3. Nevertheless, entanglement across the subsystem boundary can 
be exploited to bound joint event probabilities, just as correlations are exploited by a classical 
coupling argument. More formally, we proceed by proving the following lemmas, which together 
imply part (ii). 

Lemma 9 Vi{T-check passes) = Oijgr ^^^re Pr[n(z;)] is the probability of measuring 

n(t;) on the maximally mixed state. 

Proof This is trivial, since r-check independently measures each projector n(u) on the maxi- 
mally mixed state drawn from the uniform random sources Qi. (Indeed, this is true independent 
of the order in which r-check visits the vertices.) □ 

Remember that the tree is constructed by reading the log backwards. 
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Algorithm 3 Quantum r-check 



function r-check(r) 

sort nodes of r in reverse breadth-first order; 
for all v G T do 
for all X G [i] do 

end 

measure 11^ on 

if the measurement was satisfied (gave outcome 1 — Ilj) then 

return "failed" 
end if 
end 

return "passed" 
end function 



The next lemma establishes the quantum generalisation of Eq. (7) in the classical argument. 



Lemma 10 Couple the random sources Pi of Algorithm 2 and Qi of Algorithm 3, so that the 
nth qudit from Pi is maximally entangled with the nth qudit from Qi. Then, if {Hi} commute, 
^^{t -check passes\T occurs) = 1. 

Proof First, notice that if r occurs in Algorithm 2's execution log, then each label n(f ) in 
r corresponds to a unique violation n(f ) in the log. Furthermore, the partial ordering with 
respect to projector intersections of that section of the log that generates r, is precisely the 
partial order defined by r itself. Therefore, by visiting the vertices in reverse-breadth-first 
order, r-check will measure the projectors n(f in the same order as Algorithm 2 measured 
n(f), up to reorderings of projectors that act on disjoint sets of qudits. Thus, when r-check 
draws random qudits on which order to measure Ii{vY, it dr aws exactly the qudits from Qi 
that started off maximally entangled with those on which Algorithm 2 measured Il{v)} 

It is helpful to picture each random source Pi,Qi as a semi- infinite stack of qudits (each 
being one half of a maximally entangled pair between Pi and Qi). When a fresh qudit is drawn, 
the random source takes it from the bottom of it stack. The nth qudit in the Pi stack is then 
in one-to-one correspondence with the nth qudit in the Qi stack, with which it is maximally 
entangled. Also, when Algorithm 2 replaces the qudits on which it has just measured a violation 
Ilj with fresh ones from the Pi stacks (Line 11), we can equivalently think of this as appending 
the fresh qudits to the register \a), and redefining all projectors that act on qudits [i] to now act 
on these new qudits instead. For a given state of register \a), let |/3) denote the corresponding 
qudits from the Qi stacks. (As with \a) this is a slight abuse of notation, as the reduced state 
of the qudits in |/3) will not necessarily be pure.) 

In this picture, let Pt denote the projector on the appropriate qudits on la) corresponding 
to the tth measurement outcome of Algorithm 2, which could be either a violation Ilj or a 
satisfied measurement 1 — Ilj. (Since we are now appending qudits from Pj to the register \a) 
whenever Algorithm 2 measures a violation, and simply redefining which qudits of |a) future 
measurements Ilj act on instead of tracing out the old ones, the set of qudits on which the 
projector Pt acts depends on the violations Pj<t that occurred prior to Pt.) 

Let |r2) = eg) Iw) = \i)\i) /d)®'" denote the maximally entangled state between \a) and 
1/3). Assume Algorithm 2 measures a violation Pt = Il{v) (acting on the appropriate qudits of 

^So far, this is identical to the argument in [Moser and Tardos, 2010, Lemma 2.1]. However, between drawing 
qudits from Pi and measuring Tl{v) on them, Algorithm 2 may have performed arbitrarily many "satisfied" 
measurements. 
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the growing \a) register) in the tth measurement. The (unnormaUsed) state of |a,/3) after this 
measurement is then given by 

\a, /3) = [U{v)Pt^i • • • P2P1 1] \n) (9) 

What is probabihty that r-check "fails" when it performs the corresponding Il{v)^ measure- 
ment, given that r occurs in the log? We know from above that when r-check draws qudits 
from to measure n(f)^, it obtains precisely those qudits from |/3) that correspond to the 
qudits in \a) on which Il{v) acts. Now, r-check outputs "fail" if it measures 1 — Il{v)'^ , so the 



probability of this occurring is given by 

Tr[l®(l-n(i;)^)|a,/3)(a,/3|] (10a) 

= {a, f3\ 10(1- U{vf)\a,p) (10b) 

oc {n\ [P1P2 • • • Pt^iU{v)Pt^i ■ ■ ■ P2P1] ® (1 - niv f) \n) (loc) 

= {n\ [P1P2 ■ ■ ■ Pt-i] ® [(1 - ii{v)'^)p]; p^ ■ ■ ■ Pliii{vf] \^) (lod) 

= (Q| [P1P2 ■ ■ ■ Pt-i] ® [PiPl ■ ■ ■ Pt^i{l - Tl{v)^)Yi{v)'^] \n) (lOe) 

= 0, (lOf) 

where Eq. (10c) is only a proportionality because we have not normalised the state, and in 

Eq. (lOd) we have used the property of the maximally entangled state A®\ \uj) = 1 Cg) A'- \uj), 



which holds for any operator A. In Eq. (10c), we have used the fact that all the {Pi} commute 
with n(?;), which follows immediately from the fact that {Hj} commute, by assumption. 

This holds for any violation n(u), and any sequence of measurement outcomes Pj<f prior 
to Pt = n(w). Therefore, given that r occurs in Algorithm 2's log, the probability of r-check 
"failing" on any of its measurements is zero, which proves the lemma. □ 

If we trace out r-check, then the states drawn from Pi are maximally mixed, as required in 
Algorithm 2. Similarly, if we trace out the algorithm, then the states drawn from Qi by r-check 
are maximally mixed as required in Algorithm 3, so Lemma 9 still holds when the random 
sources are entangled. Thus, as in the analogous classical proof of Moser and Tardos [2010], 
Lemmas 9 and 10 together prove part (ii). □ 

3.2 Expected number of violations 

Having shown in Lemma 8 that a quantum version of Lemma 2.1 from Moser and Tardos [2010] 
holds, the remainder of the argument in Moser and Tardos [2010] goes through unchanged. We 
repeat the argument here for completeness. 

In order to bound the probability of a tree r occurring in the log given in part (ii) of 
Lemma 8, Moser and Tardos [2010] relate it to the probability of the following Galton- Watson 
branching process generating a proper witness tree r^ whose root vertex is labelled by some 
fixed projector 11^. In the first iteration, the process produces a root vertex labelled by 11^. 
In subsequent iterations, the process considers each vertex added in the previous iteration 
independently. For vertex v labelled by Ilj, it considers each projector Ilj in the set r"'"(nj) of 
projectors that intersect with Ilj (including Ilj itself). Independently for each such Ilj S r^(nj), 
it chooses at random whether to add a child vertex labelled by Ilj below v with probability Xj, 
or whether to skip lij with probability 1 — Xj. Thus if all the Ilj are skipped, the branch dies 
out at V. The process continues until all branches die out. (Depending on the probabilities, the 
process could of course continue indefinitely.) 

Lemma 3.1 of Moser and Tardos [2010] computes the probability that this process produces 
a given tree r^ using standard techniques for Galton- Watson branching processes. We restate 
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the lemma here for convenience. For notational simpUcity, let 



Xj^ — "^i 



• na-^.)- (11) 

njGr(n,) 



Lemma 11 (Moser and Tardos [2010]) Let Ta he a given proper witness tree whose root 
vertex is labelled by Ha- The probability that the Galton- Watson process described above produces 
the tree Ta is 

Pr(ra) = ^ n (12) 

(where the product is over all vertex labels in Ta, including repetitions). 

Using this, we can bound the expected number of violations seen by Algorithm 2. Note that 
this bound is independent of the number of iterations in Algorithm 2. The expected number of 
violations is constant even if we run Algorithm 2 forever. 

Theorem 12 For any set of mutually- commuting {Hi} satisfying the Lovdsz conditions of Def- 
inition 6, the expected number of violations seen by Algorithm 2 is bounded by 

m 

E(total number of violations) < — . (13) 

. -I 1 Xi 

1=1 

Proof Let Na be the number of times a given projector XIq appears in Algorithm 2's log, which 
is the same as the number of times Ha is measured to be violated. Let Ta denote all proper 
witness trees whose root vertex is labelled by XIq. Then, from Lemma 8, we have 

E{Na) = Pr(T appears in the log) < ^ JJPr[n(z;)]. (14) 

Now, the probability Prpj] of a projector being violated on a random state is just given by 
its relative dimension Pr(nj) = R{Ili). Since by assumption the projectors satisfy the Lovasz 
conditions (Definition 6), the relative dimension satisfies R(Jli) < x'-. Thus 



T 



the final inequality following from Lemma 11, Pr(ra) being the probability of the Galton- Watson 
process of Lemma 11 generating tree Ta- Since that process either produces a proper witness 
tree in Ta with root vertex labelled by Ha, or continues indefinitely, we have ^^-^-y Pr(T) < 1, 
thus 

E(A^.) < Yl ^ T^' (16) 

1 - a^a ^ l-Xa 

T&Ta 

and the theorem follows from summing over all projectors. □ 



3.3 An efficient quantum algorithm 

We are now in a position to prove Theorem 7: a constructive version of the commutative 
Quantum Lovasz Local Lemma. From the bound on the expected number of violations seen 
by Algorithm 2 already proven in the previous section, it is easy to prove the existence part of 
the commutative Quantum Lovasz Local Lemma (i.e. that there exists a state satisfying all the 
constraints) . 
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Proof (of existence part of Theorem 7) The theorem asserts existence of a state p such 
that Vi : Trpj/o] = 0, when {Ilj} is a set of projectors satisfying the Lovasz conditions. Assume 
for contradiction that no such state exists. Then we have 

5 := min — V TrlUip] > 0. (17) 
p m ^-^ 

i 

But ^ ^ • Tr[np] is just the probabihty of a randomly chosen projector being violated by state 
p, so (5 is a lower bound on the probability of seeing a violation in one iteration of Algorithm 2. 
Therefore, the expected number of violations after running the algorithm for t iterations is 
lower-bounded by t5. For sufficiently large t, this leads to a contradiction with Theorem 12. □ 

However, this argument is insufficient to prove the second part of Theorem 7: that there 
exists an efficient algorithm for approximating this state. To prove this, we construct a modified 
version of Algorithm 2, described in Algorithm 4, which essentially runs Algorithm 2 for a 
random number of iterations. 

Algorithm 4 Quantum converger 
1: function converge_qlll((ni, 112, • • • ) ^m)) 
2: Pick an integer < r < t uniformly at random; 
3: Initialise the assignment register as in Algorithm 2; 
4: i ^ 0; 
5: loop t times 
6: if i < T then 

7: Apply one iteration of Algorithm 2's main loop; 

8: f ^ « + 1; 

9: end if 

10: end loop 

11; return assignment register; 
12: end function 



The following general theorem shows that the time required for Algorithm 4 to converge to 
the desired state is directly related to the expected number of violations seen by Algorithm 2. 
(Note that this theorem does not require commuting projectors.) 

Theorem 13 For an arbitrary set of m projectors {11,,}, let E he the expected number of vi- 
olations seen when Algorithm 2 is run with those projectors. If E is finite, then the state p 
produced by running Algorithm 4 for time t = inE/e satisfies \/i : Trpj/j] < e. 

Proof Let pr denote the final state of Algorithm 4's assignment register given that it picked 
the value r. Note that this is the same as the state of Algorithm 2's assignment register after 
the t"^^ iteration. Since r is chosen at random, and we don't learn its value, the final state of 
Algorithm 4's assignment register is described by the density matrix 

^ r=0 

Since p-r is the same as the state of Algorithm 2's assignment register after r iterations, the 
quantity ^ Tr[njpT-] is equal to the probability of seeing a violation in the r + 1**^ iteration 
of Algorithm 2. So the expected number of violations when Algorithm 2 runs for 1 iterations 
is given by 

E = Y^ Pr(violation in t^^ iteration) = V f — V Trpip^l^ . (19) 
^0 ^oV"^ i / 
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This together with Eq. (18) impHes 



m ^—^ m ^—^ t t V m ^—^ / t 

■i i' n n n M ^ i' ' 



(20) 



r=0 T=0 

Since Trpjp] are positive, each term must be bounded by 

mE 



Tr[Uip] < 



t 



thus running for time t = mE/e gives the desired bound Trp^p] < e. 



(21) 

□ 



For commuting {Ilj}, we have a bound E < ^iXi/{l — Xj) from Theorem 12. Thus Theo- 
rems 12 and 13 together prove an efficient algorithm in the commutative case. Accounting for all 
parameters in the run-time analysis, note that Algorithm 2 requires 0{n) time to generate the 
initial assignment, and 0(|[i]|) time to resample \ [i]\ qudits for each iteration in which projector 
Ilj was violated. Thus, if we run Algorithm 4 for t = fn/ e^-Xi/ [1 — Xi) iterations, taking a 
total time of 



O n + 



m 

m \ - Xi 



1=1 



(22) 



then Theorem 13 guarantees that the resulting state p will satisfy Vi : Trpj/?] < e. 

To see that this is equivalent to the condition Tr[Po/3] ^ 1 — where Pq is the projector onto 
the subspace span{|^) : Villj {ip) = 0}, note that for commuting projectors Pq = 1 — YYlLi ^i- 
Thus 



Tr[Po/o] = Tr 



1=1 



1 - Tr 



i=l 



> 1 - Tr[n,p] > 1 - e. 



(23) 



where the second equality relies of the fact that the {Ilj} mutually commute. 

Thus the state p fulfils all the requirements of Theorem 7, thereby proving the constructive 
commutative Quantum Lovasz Local Lemma. 



3.4 An exact efficient quantum algorithm 

The algorithm of the previous section constructs in polyomial-time a state that is e-close to 
the one whose existence is asserted by the QLLL. This is the natural behaviour to demand of 
a constructive algorithm in the quantum setting. As we will see in Section 6, it generalises 
straightforwardly to the non-commutative case. 

However, Moser [2009] and Moser and Tardos [2010] express the classical result slightly 
differently. They prove that their algorithm constructs the desired state exactly, in polynomial 
expected time.^ It is not reasonable to demand this in the general quantum setting. But in the 
case of commuting projectors, this is a valid and equivalent way to formulate the constructive 
commutative QLLL. In this section, we use the results of the previous sections to give an 
algorithm that constructs the desired state exactly, with polynomial expected run-time. 

The subspace of states fulfilling the QLLL requirement Vz : Ilj |^/^) = can be characterised 
as the intersection of the supports (coimages) of the projectors 1 — Ilj. Let 5j be the support of 
Pj = 1 — Ilj. Then the desired subspace Sq is S = f]™ Si, and Pq is exactly the projector onto 
Sq. Given any state p with non-zero overlap with Pq, i.e. Tr[Po/o] > 0, we can construct a state 
in Pq by projecting onto Pq: PqpPq/ Tv{Pqp). For commuting projectors Ilj, Pq = YViLii'^ — Hj). 
This motivates Algorithm 5, a slight modification of Algorithm 2 where we fix an order in which 
to apply the projectors. We immediately observe the following: 

^Moser and Tardos [2010] also prove a deterministic version. 
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Lemma 14 Let PqpPq he the unnormalised projection of p onto the subspace of states satisfying 
the QLLL requirement \/i : = 0. Consider a run of Algorithm 5, starting from an initial 

state p with Tr[PQp] > 0. If no measurement has been violated throughout the entire run, then 
the assignment register contains the state TpT, where T = Pm ■ Pm-i • • • P2 ■ Pi- 



Algorithm 5 Exact Commutative QLLL solver 



procedure solve_qlll((ni, 112, • • • 1 11^)) 
for all X G X do 

\ai)^P 
end 



loop until c = m or for a maximum of (m + l){pm' + 1) iterations 
Pick the next projector Ilj according to some fixed order; 
Measure Ilj; 

if the projector was violated then 
c ^ 0; 

Append Ilj to the execution log; 
for all X G [i] do 

\ai) ^ P; 
end 
else 

c ^ c + 1; 
end if 
end loop 
if c = m then 

Return "success" 
else 

Return "failure" 
end if 
end procedure 



Quantum mechanics does not allow us to apply projector T = Pm ■ Pm-i ■ ■ ■ P2 ■ Pi deter- 
ministically: some measurements will be violated with positive probability. But we know from 
Theorem 12, which applies equally well to Algorithm 5, that the expected number of violations 
is constant no matter how many iterations are performed in Algorithm 5. We can use this to 
upper-bound the expected number of iterations required to produce one contiguous run of all 
m projectors. 

Lemma 15 For any integer p > 1, the probability that Algorithm 5 returns "success" is at least 
1 — 1/p. In this case, the assignment register is in state T pT / Tt\T pT], where T is the desired 
sequence of projections. 

Proof From Theorem 12, we know that the expected number of violations is bounded from 
above by m' = X]ja^i/(1 — xi). Let M be the random variable counting the total number of 
violations. By Markov's inequality, Pr[M < pm'] > 1 — 1/p. Thus, running the algorithm 
for (m + l){pm' + 1) iterations, we will see with probability 1 — 1/p at most pm' undesired 
projections. 

These pm' projections partition the execution log of length {m + l)(p?Ti' + 1) into at most 
pm' + 1 stretches of desired projections. Thus at least one run of length m consisting exclusively 
of the desired projections must occur within any run of the algorithm with probability 1 — 1/p. 
The termination condition of Algorithm 5 ensures that it halts after the first occurence of such 
a run. □ 
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Combining Lemmas 14 and 15, we see that the algorithm must perform 0{pnim') iterations 
of which at most pm' lead to a resampling of p YliLi " | W | qudits, thus leading to an overall 
effort of 

o(^n+pmp^^^.\[i]\^ , (24) 

where n reflects the cost for sampling the initial n random qudits. Note that we can easily 
recover Theorem 7 from this, by defining p = 1/e. 

Assume we repeatedly run Algorithm 5 until it suceeds. By fixing p > 1 to be some constant, 
we know from Lemma 15 that in each iteration Algorithm 5 succeeds with probability 1 — 1/p, 
after which it stops, or fails with probability 1 /p leading to another iteration. Thus this is a pro- 
cess of repeated Bernoulli trials until the first success with a geometric distribution Pr(X = k) = 
{l/p){l — l/p)^~^^ and expected termination time E(X) = p. Thus, this process has an overall 
expected run-time of 

o(^n + m^^^.|H|^ , (25) 

and terminates only when it has applied a contiguous run of all m mutually-commuting projec- 
tors, which constructs the desired state exactly. 



4 A Combinatorial Proof 

In this section, we give an alternative proof of the key Theorem 12, which bounded the expected 
number of violations seen by Algorithm 2. In Section 3, we proved Theorem 12 using a quantum 
coupling argument. Here, we replace the coupling argument with a combinatorial one. The 
resulting proof is substantially more involved than the proof based on coupling, which attests 
to the power of even the simple quantum coupling argument used in Section 3. However, like 
many such proofs, the quantum coupling argument may look a little like "black magic". The 
alternative proof described here uses straightforward linear algebra and a slight generalisation of 
witness trees to directed acyclic graphs (DAG) to arrive at the same bound, providing different 
insight into how the QLLL algorithm works. 

Moser and Tardos [2010] prove the constructive LLL by bounding the probability of observ- 
ing a given sequence of violated events in Algorithm 1. The quantum generalisation of this 
type of algorithm is a quantum stochastic process generated by iterated measurement. Here, 
we prove a number of results concerning iterated quantum measurement processes, which may 
be of independent interest. We then re-prove Theorem 12 using these results. 



4.1 A Simple Iterated Measurement Process 

As a warm up, let us start with the following quantum stochastic process, which will serve as a 
building block for later, more complex processes. 

Lemma 16 Let {Hj} be a finite set of commuting projectors on a C"'. Consider the quantum 
stochastic process produced by repeatedly picking one of the projectors uniformly at random, and 
performing the two-outcome measurement {Hi, 1 — Hj}, starting from the maximally mixed state 
1/d, until a measurement gives the H outcome, whereupon the process halts. 

Let Pa be the final state of the system given that the process halted on outcome Ha, andpa the 
probability that this occurs. Then the unnormalised density operator Xa = PaPa corresponding 
to this outcome satisfies the operator inequality 

Xa = PaPa < ^a/d, (26) 

hence 

Pa < Tv[Ua/d\. (27) 
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Proof Let m be the total number of projectors, so that in each iteration there is a probability 
1/m of picking a given projector. The generalised measurement describing the different possible 
outcomes in a given iteration is described by a collection of trace non-increasing CP maps 
{Aik}, such that X^^A^fe is trace-preserving. To obtain the outcome Ha in a given iteration, 
we must pick the projector Ua in that iteration and also obtain the outcome Ha in the resulting 
{Ilj, 1 — Ilj} measurement. This corresponds to the measurement element ^Aia{p) = ^HapHa- 
In order to have reached that iteration, we must have obtained one of the 1 — 11 outcomes 
in all previous iterations, otherwise the process would already have halted. The measurement 
element corresponding to the process not halting at a particular iteration is therefore given by 
-A4cont(p) = ^(1 — H.i)p{l — Hi). Thus the unnormalised density operator produced by the 
process halting with outcome Ha in the t + 1*^ iteration is described by 

—Ma O Meant O'-'O A^cont(p) = —Ma o (28) 

m V ' rn 

t 

The process can halt after any number of iterations, so the overall unnormalised density operator 
corresponding to the process eventually halting on outcome Ha is 



1 °° 1 1 

Xa = -J2Y1 — - n^J •••(!- Hi,)- (1 - Hi,) ■■■{!- Hi,)Ha (29a) 

t=0 io,...,it 

1 °° 1 1 

= -E E — ^a{t-HiJ ■■■{!- Hi,)- {l-Hi,)---{l-Hi,)Ha (29b) 



m ^ — ' ^ — ' 

t=0 io,...,ii^a 



^-E E ^n^n, (29c) 

m ^-^ ^-^ , rrv' a 

t=0 io,...,itj^a 

= -E n,-n, 29d 

t=o ^ ' 

= Hald (29e) 

as claimed. The equality in Eq. (29b) comes from commuting Ha all the way through the 
product of (1 — nj)'s, killing all terms in the sum containing any (1 — Hg) factor. Thus the sum 
is only over the remaining (m — 1)* terms. The inequality follows from the fact that, since all 
the (1 — nj)'s commute, 



(1 - n,j • • • (1 - n,„)l (1 - Hi,) . . . (1 - n,j < 1. (30) 



The final part of the lemma follows immediately from the fact that the probability of the 
process halting on outcome Ha is given by Tr[Xa]. □ 



4.2 Resample DAGs 

We now turn to a more complicated quantum stochastic process that produces an infinite 
sequence of measurement outcomes. For this we require some basic definitions. 

Definition 17 (Resample DAG) Let {Hi\ he a finite set of projectors acting on a tensor 
product space ^il-Li, and let Ha^^Ha^^ ■ ■ ■ he a sequence of projectors chosen from this set. 
The resample DAG 'S{Hax-,Ha2-, ■ ■ ■ jHaJ of such a sequence is the directed acyclic graph (DAG) 
with vertices lahelled hy sequence elements, and a directed edge from IXq. to Haj iff the two 
projectors intersect and Ha. occurs before Haj. (Thus the resample DAG encodes the partial 
ordering of the sequence with respect to projector intersections.) 
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(Note that this is the same as the "resample graph" defined by Kolipaka and Szegedy [2011] 
for the classical setting.) ©(Ila^jllaj, ■ ■ ■ ,^ai) can be constructed from the sequence as follows. 
Work backwards through the sequence, starting with the final projector Har For each projector 
Ha^ , find the set Li of all vertices labelled by projectors that intersect with Xla^ (which may be 
empty). Add a new vertex, labelling it by Ha^, and create directed edges from each element of 
Li to the new vertex. 

Definition 18 (DAG Probability) Let {Hi} be a finite set of projectors acting on a tensor 
product space (^iHi, and let & be a resample DAG over these projectors. The DAG probability 
with respect to & of the sequence , Ii^a2 , ■ ■ ■ , , denoted {Hai , , • • • , 11^; ) , is the proba- 
bility that the following process generates the sequence. Starting with the resample DAG & and 
an empty initial sequence, pick one of the leaf vertices uniformly at random and append its label 
to the end of the sequence, removing the vertex from the DAG. Repeat this procedure until no 
vertices are left. 



4.3 A More Complicated Iterated Measurement Process 

We are now in a position to study a more complicated quantum stochastic process, on which 
our constructive QLLL algorithm will ultimately be based. 

Lemma 19 Let {Hi} be a finite set of commuting projectors on an n-qudit Hilbert space % 
with total dimension d. Consider the quantum stochastic process which starts from the maximally 
mixed state runs the process of Lemma 16 until it halts on some measurement outcome Ha, 
then applies the trace-preserving CP map 

£:„(/.) = Trr,i(p)^iM, (31) 



which reinitialises the qudits measured by Ha to the maximally mixed state, and repeats these two 
steps indefinitely. Here, [a] denotes the subset of qudits on which projector Ha acts non-trivially, 
and d^a] denotes the Hilbert space dimension of that subset. 

Let Xa. ,a2,...,at denote the unnormalised density operator corresponding to obtaining an out- 
come in which the first t iterations of the Lemma 16 sub-process halted in turn on the sequence 
of measurement outcomes Ha^.,Ha2-, ■ ■ ■ jHa^. Then Xa^,a2,...,at satisfies the operator inequality 

* 1 

Xa, ,a2,...,at 

< PC (n„, , n,, , . . . , J J] Tr [n,^ /d] • - , (32) 

i=l 

and the probability of this occurring is bounded by 

t 

Fi{Ha, ,Ha2,...,Ha,)<p@ {Ha, , H,, , . . . , J J] T^^ [Ha, /d] , (33) 



i=l 



where the resample DAG & = &{Haj^,Ha2, ■ ■ ■ ,Hat). 



Before proving this lemma, we first establish some basic identities concerning compositions 
of the various maps involved in Lemmas 16 and 19. 

Lemma 20 Let the CP maps M.a, -Accent o,nd £a be defined as in Lemmas 16 and 19: 

Ma{p)=HapHa, (34) 



M 



cont 



{p) = Y.-{l-Hi)p{l-Hi), (35) 

i 

£,{p)=Tr[a]ip)®^. (36) 
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In addition, for disjoint Ha^^b; define the CP maps 

oo 

A^s = J]A^*ont, (37) 

t=0 

Ma,b{p) = {Ha » nb)p{Ua Hfe), (38) 

£a,b{p) = £a o £b{p) = Tr[,]uM(p) ® (39) 

"[a]U[fe] 

Recall that all projectors Hi in these definitions are assumed to commute. The following relations 
hold: 

(i) - ^Mai,...,ak°Mj^{l/d) < i7Wai,...,aJl/(i) for disjoint Ha,, 

(ii) . £aoMa{l/d)=TT[Ua/d]]. 

d 

(Hi). Ma = Ma o Ma, 

(iv) . MaoMb = MboMa= Ma,b for disjoint Ua, U^, 

( v) . Sa° £b = £b° £a = £a,b for disjoint Ua, Ub, 

( vi) . Ma o£b = £b° Ma for disjoint Ha, Hb, 

(vii) . Ma o Mj: = Mj: o Ma, 

(When we write M < N for CP maps M,M, we mean that M{X) < M{X) for any positive 
operator X.) 

Proof To prove Part (i) , we start by applying almost exactly the same argument as in the proof 
of Lemma 16, replacing Hq with (^f=x and noting that this tensor product of projectors 
kills k terms from the sum in Eq. (29b), rather than just the one in the proof of Lemma 16. 
Thus 

k 



Ma,,...,a,oMl,At/d)<^Y. E i((2)na.)M®n- 

t=0 io,...,- 

' m — 



m ^-^ ^-^ mJ- \ ^— ^ / " , , 

t=0 'to,...,Jt5^a \i=\ / \i=l / ^^Q^ 



^ , Ma,,...^a,{\ld) 



and 



1 °° 

-Ma,,...,a, o MT.(\ld) = -Y, Ma,,...,a, o Ml^,{l/d) (41a) 

m m 

s^EO^)x« 'M") (41b) 

t=0 ^ ^ 

= \Ma,,...,a,i\ld). (41c) 

Part (ii) follows immediately from the definitions of Ea and Ma- Part (iii) is immediate 
from = Ha. Parts (iv) and (v) follow trivially from the fact that two disjoint projectors by 
definition only act non-trivially on different subsystems. Similarly, in Part (vi), since Ha and 

are disjoint, the partial trace in Eb is over a subsystem on which the projector Hq in Ma 
acts trivially. Finally, Part (vii) follows from the fact that all the 11, commute. □ 



19 



We are now in a position to prove Lemma 19. 

Proof (of Lemma 19) The unnormalised density operator ^ria^, Hag, ■■■,1104 corresponding to 
the process of Lemma 19 producing the sequence , , • • • , is given by 

Xnaj^,na^,...,Uat 

= £a, o ^^Ma, o A^E o £a,^, o ^Ma,_, o o ■ ■ ■ o o l^Ma, o M^il/d). (42) 

We start by using Part (iii) of Lemma 20 to duplicate every Mai which can be commuted 
all the way through the expression to right using Parts (iv), (vi) and (vii). This will produce 
a string of Mai's of the form Mai ° o ■ ■ ■ o Mai next to the inner-most Mai- Since each 
Maj is preceded by a Sa, in Eq. (42), and Mai °^iy be commuted through Sa^ if and 
Ha are disjoint, all Mai's in this string necessarily have disjoint Ha., so we can combine them 
into a single Ma^,aj,...,ai using Part (iv) of Lemma 20. 

At this point, the expression has the form 

= £a, o ^Ma, o M^ o £^^_^ o ^Ma,_i o Mj: o ■ ■ ■ o o ^Ma,,a„...,ai o M^{l/d). (43) 

We can now use Part (i) of Lemma 20 to eliminate the inner-most Ms: 

Sat ° ^Mai o Ms o £^^_^ o ^Ma,_i o Ms o ■ ■ ■ o o ^Ma,,a, ,...,ai o Ms{l/d) 

< £a, o ^Ma, o Ms o £ai_i o ^Ma.^i o A^s O • • • O ^T,^ o lMa,,aj,...,ai{l/d), (44) 

where k is the number of Hq^'s in the Mai,aj,...,ai- 

Now we reverse the above steps. First we separate out Mai,aj....,ai again into Mai ° -^"j ° 
■■■oM ai using Part (iv) of Lemma 20. Then we use Parts (iv), (vi) and (vii) to move all 
the separated Mai (except Ma^ back through the expression to the left, until they can be 
recombined using Part (iii) with the Mai that they were originally duplicated from. We are left 
with 

^n.„n.„...,n., < ^a, o ^Ma^ o Ms o £ai^i o ^A^a,_i oMso---o£,^o \Mai{\ld). (45) 
We now use Part (ii) of Lemma 20 to arrive at 

< ^ Tr[n,7d] • £a, o ^Ma, o Ms o £a^-i o ^Ma.^i o A^E o • • • o o ^Ma^ o Ms^^jd). 

(46) 

Comparing with Eq. (42), we see that, in this way, we have eliminated the inner-most £ai o 
^Mai ° Ms terms from the expression, picking up a scalar factor of l/Zc in the process which 
depends on how many Mai could be commuted through far enough to reach the inner-most Mai ■ 
The sequence of CP maps on the right hand side of Eq. (46) now has exactly the same 
form as the original Eq. (42), but shortened by omiting the oi terms. Thus we have obtained 
a recursive operator inequality for XYia^,Ua2,---,Tiat- 

^n.„n.„...,n„, < ^ Tr[n,^/d] • Xn„,,...,n„, ■ (47) 

Applying this inequality recursively, we finally arrive at 

1 * 1 
^n.„n.„...,n„, <:^-nTr[n.p]-;., (48) 
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with a yet-to-be-determined combinatorial coefficient 1/V arising from tfie 1/k coefficients in- 
troduced by Part (i) of Lemma 20. 

It remains to determine tlie combinatorial coefficient 1/V. This is a product of all the 1/k 
coefficients contributed by using Part (i) of Lemma 20 to eliminate an A^s- Now, each A^s is 
eliminated by a map of the form J^aa,ap,... (cf- Eq. (44)). Let S = {Ila^,Ilaj■, ■ ■ ■} be the set 
of elements from the original sequence Ila^^I\.a2^ ■ ■ ■ ^^at that are involved in the map TWa^.a^,... 
that eliminates the A^s in question. Then the elimination of Afs by Maa,a0,... contributes a 
factor of l/l^l. Thus, to understand the combinatorial coefficient 1/V, it suffices to understand 
all the sets S. It will be useful in what follows to consider both the subset S of elements from the 
original sequence, and the set of vertices V{S) from the resample DAG 6 = &{Ilai , , • • • , 
that are labelled by the elements of S. 

Each A^E appears immediately to the right of a particular Mai ™ original expression of 
Eq. (42), which in turn corresponds to a particular element IIq- of the original sequence. Thus 
each Als can be identified with a unique element 11^. of the sequence. Consider a particular 
Als, and let 11^^ be the element of the original sequence with which it is identified. Now, the 
map M.aa,a0,... that eliminates this Als is made up of all those Maiys lyi^ig to the left of Mas 
in the original expression Eq. (42) which could be commuted past all the intervening maps 
using Parts (iv), (vi) and (vii) of Lemma 20. Note that AIq. can only be moved past a £a^ if 
Ila- is disjoint from 11^^. Thus we can determine the set S corresponding to the Afaco^,... that 
eliminates a given Afs just by looking at the original sequence alone. 

S can be found by the following procedure, which simply mirrors on the level of the sequence 
the steps involved in eliminating the Als- (Cf. the reduction of Eq. (42) to Eq. (46).) Start 
with the final element of the sequence,^ and move it as far as possible towards the beginning 
of the sequence, subject to the rule that it can only be moved past an element if it is disjoint 
from that element. Repeat this procedure once'^ for each element Hi^s that appears later in the 
sequence than Ila^, starting from the final element Hat working backwards through Hai^s until 
Ila^ . S is then the set of all elements that have been moved to the left of 11^^ , including 11^^ 
itself (but not including any elements 11^.^^, which were never moved). 

How do the elements of S relate to vertices in the resample DAG (5? If any descendant of 
Hai appears in between 11^^ and Ha. in the sequence, then IIq. cannot be in S. To see this, note 
that by Definition 17 all ancestors of an element must appear later in the sequence than that 
element.'^ Therefore, if 11^^. is a descendant of 11^., then there is at least one child (possibly 11^^. 
itself) of Hq- in between 11^^ and Uai- But by Definition 17 n^. intersects with all its children, 
so it cannot be moved past this child. 

Thus no element of S has any descendant appearing between it and 11^^. But, as we've 
already noted, any descendant of an element must appear earlier in the sequence than that 
element. Therefore, no element of S has any descendants appearing anywhere after 11^^ in the 
sequence. Conversely, if none of the descendents of an element 11^. appear after in the 
sequence, then Hq,. is in S. This is because the only thing that could stop 11^. from being 
moved past Ila^ is a projector with which it intersects appearing between it and 11^^, but by 
Definition 17 any such projector will be a descendant of Hq. . 

S is therefore the set of all elements appearing after 11^^ in the sequence which have 
no descendants amongst the elements n^j^^. In other words, if we consider the resample 
DAG (5s(na^ , Hfl^^-^ , . . . , IlaJ for the subsequence Ila^ , 11^^^-^ , . . . , 11^^ , then the elements of 
S are the leaf vertices of Note that &s can be obtained from the full resample DAG 
&{Ilas , 'nas+i , ■ ■ ■ , HaJ by removing the vertices labelled by 11^^ , . . . , 11^^^^ . 

Recall that the elimination of the Als corresponding to 11^^ contributes a factor 1/\S\ to 

^Recall that the maps Ala; in the composition of Eq. (42) are in reverse order to the corresponding elements 
Jla; in the sequence. 

^Once we have finished moving a particular element, we are not allowed to move it again later, even if 
subsequent moves have "unblocked" it. 

■^Recall from Definition 17 that the resample DAG is constructed by working backwards through the sequence. 
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the combinatorial coefficient 1/7^, and we have seen that 5 is the set of all leaves remaining 
in the resample DAG after vertices labelled by elements preceding Has have been removed. So 
1/\S\ is just the probability of removing Hq^ when a leaf is picked uniformly at random and 
removed, given that vertices labelled by elements preceding have already been removed 
from the resample DAG. There is one such factor 1/\S\ contributed by each element Has of the 
sequence, so the overall combinatorial coefficient 1/V in Eq. (48), which is the product of all 
these 1/\S\ factors, is precisely the DAG probability of Definition 18. This completes the proof 
of Eq. (32). 

The probability bound of Eq. (33) follows immediately from Eq. (32) and the fact that the 
probability of the obtaining the sequence corresponding to the unnormalised density operator 

Xai,a2,...,at is given by Tr[Xai,a2,...,at]- □ 

Lemma 19 immediately implies a simple bound on the probability of the measurement 
outcomes from the Lemma 19 process forming a given resample DAG. 

Corollary 21 Let G be a fixed resample DAG with vertices v G V{G) labelled by H^, and let 
Hai , , • • • , Hflj be the first t measurement outcomes produced by the Lemma 19 process. Then 

Pr[(5(n,,,n„„...,n,J = G]< J] Tr[n„p]. (49) 

v&V{G) 

Proof To stand any chance of producing the DAG G, we must have {Ilaj , ■ • ■ , Haj} = {n^j^gyj-g). 
The probability of producing a given resample DAG G is obtained by summing the probabilities 
of all distinct sequences that generate that DAG. Let 

S = {S = lia^,,, , Ha^,,, , . . . , n,^(,, \g{s)= G} (50) 

be the set of all sequences with resample DAG G. Note that this is just the set of all permutations 
a of {n^,} consistent with the partial order encoded by G. 

Now, every sequence s € 5 can be generated with probability pg{s) by running the process 
of Definition 18 on G; conversely, every sequence generated by running that process on G is in 
S. Since the process always generates some sequence, we have 

Y,Pg{s) = 1. (51) 

Thus, summing the probabilities from Eq. (33) of Lemma 19, we have 

t 

Pr [©(n,, , n,, , . . . , = G] = Pr(s) < pg{s) ■ n ^i^a^p] 

s:0{s)=G ses 1=1 (52) 

= n Tr[n„p] • Y.Pg{s) = n Tr[n,p]. 

v&G ses veG 



4.4 Partial Resample DAGs 

Later on, we will need to consider a variant of the resample DAG, constructed almost exactly 
as in Definition 17, but with one key difference. Instead of including all projectors from the 
sequence in the DAG, we only include those that are reachable from the vertex labelled by the 
final projector in the sequence; i.e. any projector that cannot be attached to an existing vertex 
when constructing the DAG (cf. Definition 17) is simply discarded. 

Definition 22 (Partial Resample DAG) Let {Hj} be a finite set of projectors acting on a 
tensor product space (^j'Hi, and let Ilaj^,Ha2, ■ ■ ■ ,na, be a sequence of projectors chosen from 
this set. The partial resample DAG is the subgraph of the resample DAG (5{Ilai,Ha2, ■ ■ ■ ,^ai) 
which consisting of the subset of vertices that are reachable from the vertex labelled by Uai , and 
all edges that begin and end at vertices in this subset. 
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The partial resample DAG 0(naj , Hag , • • • , HaJ can be constructed from the sequence as 
fohows. Start with the final projector XIq; in the sequence. Create a root vertex and label it 
with Har Then, working backwards through the sequence, for each projector Ha,^, find the 
set Li of all vertices labelled by projectors that intersect with If Li is empty, skip this 

projector. Otherwise, create a new vertex, labelling it by Ha^, and attach it to the DAG by 
creating directed edges from each element of Li to the new vertex. 

Definition 23 (Relevant Subsequence) Let , Ilaj , • • • , ^ sequence of projectors, 

and 0(naj,na2, . . . jllaj) the corresponding partial resample DAG. The relevant subsequence 
of Hai 1 , • • • , is the subsequence obtained by discarding from the sequence all elements 
that get discareded when ^(na. , Ilaj, • • • , is constructed according to Definition 24- Note 
that two sequences with the same relevant subsequence necessarily generate the same partial 
resample DAG. 

Definition 24 (Partial DAG Probability) Let {11,} be a finite set of projectors acting on a 
tensor product space Tii, and let g be a partial resample DAG over these projectors. The par- 
tial DAG probability with respect to g of the sequence Ila^ , , • • • , , denoted pg {liai , , • • • , ) , 
is the probability that the following process generates the sequence. Starting with the partial re- 
sample DAG Q and an empty initial sequence, pick one of the leaf vertices uniformly at random 
and append its label to the end of the sequence, removing the vertex from the DA G. Repeat this 
procedure until no vertices are left. 

The analogous result to Corollary 21 also holds for partial resample DAGs. 

Lemma 25 Let g be a fixed partial resample DAG with vertices v G Vig) labelled by projectors 
Hy, and let Ilai,^a2J ■ ■ ■ i^at be the first length-t relevant subsequence of outcomes from the 
Lemma 19 process. Then 



Proof The proof is similar to the proofs of Lemma 19 and Corollary 21, but some of the 
outcomes generated by the Lemma 19 process might not be irrelevant because they get discarded 
when constructing a partial resample DAG. We will first bound the probability of a given 
sequence ITq^ , Ilaj , . . . , Hat forming the first length-f relevant subsequence. We can then sum 
over all relevant subsequences that generate the specified partial resample DAG g to bound the 
overall probability of producing g. 

Previously, the map My. allowed for any number of 1 — 11 measurement outcomes in 
Lemma 19. We must now also allow for any number of irrelevant outcomes. A projector is 
discarded if and only if it does not intersect with any projector that occurs later in the se- 
quence.^ So, between any two elements Ha^_^ and lia^ of the sequence ■, > we can 
have any number of outcomes lix such that lix is disjoint from all Hq^^ . , as well as any number 
of 1 — n outcomes. 

In other words, if Xi is the set of irrelevant outcomes that will be discarded from the DAG if 
they occur between elements Ila- j and sequence, then the generalised measurement 

element corresponding to a single iteration of the Lemma 16 sub-process not producing a relevant 
outcome is given by 



Pr[0(na,,n, 



...,n,j = g]< J] Tr[n,p]. 



(53) 




(54) 



Once again, recall that the partial resample DAG is constructed backwards. 
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where the trace-preserving CP map 

Ta{a) = (1 - n,)a(l - n,) +£aoMa 

= (1 - n,)(j(i - n,) + ® Tr[,](n,an,). 

The map corresponding to any number of irrelevant outcomes occurring is then given by 



(55) 



(56) 



We now prove a version of Part (i) of Lemma 20 for the map 



Lemma 26 Let Yla^jHaj^, ■ ■ ■ be disjoint from each other and from all Tlx £ Xi, and let Trj;^.] 
denote the partial trace over all qudits on which some Tlx G '^i o,cts non-trivially. Then 



Tr 



iMa^,a„...o{M^i^Yil/d)\ < iTr[;,^][A^a.,a„...(l/d)] 



(57) 



Proof First, note that, since n^^, . . . ,Ilai,, are disjoint from each other and from Ilx, we have 



= Tr[^] [Ma^,ap,...{'^)] , 



(58) 



since 7^ is trace- preserving and only acts non-trivially on [X]. 
Now, 



Tr 



Ma^,a„... o (>[S,J*(l/d)' 



Tr 



^-^><^,J^ H (i-n,)(A^(L)*-i(i/d)(i-n,; 



+ - E Ma^,a„...oTao{Ml:l,Y-\l/d) 



■ (59) 



Since Hq^ , , . . . are disjoint from all Hq € A'j, all the terms involving projectors that intersect 
with , Ha^ , ■ ■ ■ occur in the first sum. The first sum can therefore be treated exactly as in 
Eq. (40) from Lemma 20, giving 



Ti' 



1 



Ma^,a„...[- E (l-na)(A^£nt)*^'(l/d)(l-na) 



< Tr 



" --Ma.,a„...o(A^«J*-l(l/d) 



m 



Using Eq. (58), the second sum simplifies to 



Tr 



(«) \t-\ 



cont/ 



(1/d) 



Tr 



m 



cont/' 



(l/d) 



Tr 



(i) \t-l 



cont J 



(l/d) 



(60) 



(61a) 
(61b) 
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Putting these together, we obtain 



Tr 



< Tr 



m — k 



m 



1/d) 



(62) 



By induction on t (the base case t = is trivial) and summing over t, we obtain the identity in 
the lemma. □ 



Proof (of Lemma 25, continued) 

The probability of obtaining JJa^ , Ilaj , . . . , Hat ^'^^ the first t relevant outcomes is given by 



Pr(n„,,n,,,...,n,j 

= Tr 



^£a,ol^Ma,oM'i\p) . (63) 



(Compare with Eq. (42) from Lemma 19. The now also account for outcomes that are 

discarded when constructing the partial resample DAG.) 

We eliminate the M-^ from this expression in much the same way as in Lemma 19, using 
Lemma 26 instead of Lemma 20, Part (i). Since a projector is discarded when constructing the 
DAG if and only if it is disjoint from all projectors that occur later in the sequence, they do not 
aff'ect which A^a/s can be commuted all the way to the right, so iterating the same sequence of 
steps again leads to a recursive operator inequality of the form 

Xn„„n„„...,n., < ^ Tr[UaJd\ • Xn„,,...,n., ■ (64) 

The only difference as compared to the inequality in Eq. (47) lies in the combinatorial factor 
1/k, which now only counts relevant measurement outcomes. 

The combinatorial factors arising from Lemma 26 are otherwise identical to those of Lemma 20, 
Part (i) , leading to an overall factor given this time by the partial DAG probability pg {Hai , , ■ ■ ■ , 
with Q = g(na^,na2, . . . jllaj. Thus we finally end up with the following bound on the prob- 
ability of obtaining Hai , , • • • , as the first t outcomes that are not discarded from the 
partial resample DAG: 

t 

PT{Ua, ,Ua„ . . . ,Uat) < P, (H,, , H,, , . . . , J • J] Tr [n,, p] . (65) 

i=l 

As in Corollary 21, when we sum over all relevant subsequences that generate the specified 
partial resample DAG g, the combinatorial coefficients pg sum to 1 and we arrive at the bound 
claimed in the lemma. □ 



4.5 Expected number of violations 

We are now in a position to re-prove the bound of Theorem 12 on the expected number of 
violations seen by Algorithm 2. Recall that Algorithm 2 keeps a log of violated measurements 
naijllaj, . . . jIIq^, whose only purpose is to aid in the analysis. We say that a given partial 
resample DAG g occurs in Algorithm 2's log if it can be constructed starting from some entry 
of the log, i.e. if for some I Q{Ilai, ■ ■ ■ , = g. 

The process implemented by Algorithm 2 is just the iterated measurement process defined 
in Lemma 19. So Lemma 25 immediately implies the following result: 
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Corollary 27 The probability that the partial resample DAG g occurs in the log is at most 
Wvi^v {g) ■^^■") ' where V{g) is the vertex set of g, n„ is the projector labelling vertex v, and 
the relative dimension R(n.i,) is equivalent to the probability of measuring n„ on the maximally 
mixed state. 

(Though proved by very different means, Corollary 27 is essentially a quantum version of 
Lemma 2.1 from Moser and Tardos [2010], though that Lemma is not quite a special case 
of Corollary 27 as it concerns "witness trees" rather than partial resample DAGs.) 

In order to bound the expected number of violations seen by Algorithm 2, we again follow 
Moser and Tardos [2010] in relating Corollary 27 to a Galton- Watson branching process. Notice 
that the bounds in Corollaries 21 and 27 are determined solely by the set of vertex labels 
appearing in the DAG; the structure of the DAG plays no role. Let r(g) be a (labelled) spanning 
tree for the partial resample DAG g. Because it is the spanning tree for an resample DAG, all 
the children of a vertex in T{g) carry distinct labels (cf. Definition 22). Labelled trees with this 
property will be called proper, irrespective of whether they are spanning trees for some DAG; 
all spanning trees are proper, but not all proper trees are spanning trees. 

We will once again relate the bound in Corollary 27 to the probability that the Galton- 
Watson process of Lemma 11 generates a proper tree Ta whose root vertex is labelled by some 
fixed projector Ua, yielding an alternative proof of Theorem 12. 

Proof (of Theorem 12) Let Na be the number of times a given projector XIq appears in 
Algorithm 2's log, which is the same as the number of times Ha was measured to be violated. 
Let Qa denote the set of all partial resample DAGs whose root vertex is labelled by Ha, and let 
Ta denote the set of all proper trees with root vertex labelled by Ha- Then, from Corollary 27, 
we have 

E{Na) = Fiiga occurs in the log) < ^ H^^^^) 

ga&Ga gaeGa v£V{ga) ,„„, 

(bbj 

= E n^(n,)< n^(n,). 

gaf^Ga veV{T{ga)) Ta^Ta V&V (Ta) 

The final relation holds because distinct resample DAGs have distinct spanning trees. It is an 
inequality for two reasons: a DAG can have multiple spanning trees, leading to double-counting, 
and the set of proper trees is a strict superset of the set of spanning trees. 

Now, by the assumption of the QLLL (Theorem 7), the relative dimension satisfies R{Ili) < 
x[. Thus 

ma)< E n^(n'')^ E n^^^ E T3^p^(^»)' (67) 

the final inequality following from Lemma 11, Pr(ra) being the probability of the Galton- Watson 
process generating tree Tq. Since that process either produces a proper tree in 7^, or continues 
indefinitely, we have ^^^^7-^ Pi'(Ta) < 1, thus 

lE(iVa) < E ^ T^' (68) 

l-Xa ^ 1 - Xa 

TaG la 

and the theorem follows from summing over all projectors. □ 

Using this alternate proof of Theorem 12, the rest of the proof of the constructive commu- 
tative QLLL (Theorem 7) goes through as before using the arguments of Sections 3.3 and 3.4. 
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5 Application: Bounding Convergence Times of CP Maps 



A (time-homogeneous) quantum Markov process is generated by iterating a fixed completely 
positive trace-preserving (CP) map £, so that t steps of the process are described by the com- 
position 

= £ o £ o ■ ■ ■ o (69) 

t 

As in the case of (classical) Markov chains, convergence times of quantum stochastic processes 
are an important field of study, with applications to quantum dynamics [Aharonov et al., 2001, 
Cubitt et al., 2009, Sanz et al., 2009], quantum information theory [Sanz et al., 2009], and 
quantum algorithms [Schwarz et al., 2011, Temme et al., 2011, Verstraete et al., 2009]. 

The results of Section 3 prove that Algorithm 4 converges in polynomial time to a state satis- 
fying the requirements of the commutating QLLL, or equivalently the ground state subspace of 
the associated Hamiltonian. This holds promise as a new technique for proving fast convergence 
of quantum Markov processes to their steady state. However, strictly speaking Algorithm 4 does 
not implement a quantum Markov process. Although each iteration of Algorithm 2, on which it 
is based, does implement a fixed CP map £, the averaging trick used in Algorithm 4 means that 
the latter algorithm does not simply iterate this map. (The overall evolution of Algorithm 4 
for a given total time t can of course be described by some CP map, but this map is not of the 
form £^.) Algorithm 5 is further still from being a quantum Markov process. 

In this section, we instead focus on the quantum Markov process defined by Algorithm 2, and 
prove that if the set of projectors defining the process satisfy the Lovasz conditions of Theorem 7, 
then this process converges quickly to the steady-state subspace (in time polynomial in the 
overlap with the steady-state subspace, and polynomial in the spectral gap of the associated 
Hamiltonian) . 

Proving fast convergence is slightly more involved than in the case of Algorithm 4, as we 
cannot use the averaging trick of Section 3.3, but instead must combine the fact that the overlap 
with the steady-state subspace is monotonic under the CP map, with Poissonian accumulation 
of the probability of measuring a violation, leading to a contradiction with Theorem 12 were 
the convergence rate too slow. 

Theorem 28 For a given set of commuting projectors ni,H2, . . . ,Ilm, ^^.t £ he the CP map 
defined by^ 

^^P^ = ^ - - + Tr/H [^^p] ® (70) 

i 

If the projectors satisfy the Lovasz conditions of Theorem 7, then the Markov process produced 
by iterating this map converges in 0{m/S'^e) iterations to a state p with fidelity Tr[Poyo] > 1 — £ 
with the steady-state subspace of £ (where Pq is the projector onto that subspace, and 6 is the 
spectral gap of H = ^^Y.i nj- 

Note, that according to the definition above the spectral gap 5 = 1/m in the case of m com- 
muting projectors considered here. 

Proof Note that Eq. (70) is precisely the map implemented by one iteration of Algorithm 2. If 
the projectors satisfy the Lovasz conditions, then we know from Theorem 3 that Pq is non-zero. 
Let pt = £^{pq) be the state of Algorithm 2's assignment register at time t. We can decompose 
pt in terms of Pq: 

Pt = PoPtPo + PoPtil - Po) + (1 - Po)ptPo + (1 - Po)pt{l - Po). (71) 

'^This is the same as the map shown in Verstraete et al. [2009] to converge to the ground state of any 
frustration-free Hamiltonian, but with no control on the convergence rate. 
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Note that PqPiPq is an (unnormalised) state supported on Pq, the cross-terms are traceless, and 
(1 — Po)Pi(l ~ Pq) is an (unnormahsed) state supported on 1 — Pq. 

Since PoptPo is supported on the subspace of states that do not violate any projector, it is 
invariant under £, thus 

Tr[Po£ipt)] =Tr[Popt] + Tr[Po£iPoPtil - Po))] + Tr[Po^:((l - Po)ptPo)] 
+ Ti-[Po£iil-Po)ptil-Po))] 
> TviPopt], (72b) 

where the inequaUty follows from dropping non- negative terms. Tr[PoPt] is therefore monotoni- 
cally non-decreasing under £. 

Let a := Tr[PQpt]. The probability of a randomly chosen projector being violated by pt is 
given by ^ X]iTr[njpt]. Since HiPo = by definition, we have 

- V TrlHipt] = - V(Tr[n,PoPt^o] + TT[UiPopt{l - Po)] (73a) 

i i 

+ Tr[n,(l - Po)ptPo] + TV[ni(l - Po)Pi(l - ^'o)]^ 
-y^Tr[ni{l-Po)pt{l-Po)] (73b) 



m 



< Tr[(l - Po)pt] (73c) 
= 1 - Q, (73d) 



where the inequality follows from the fact that Ilj < 1. 
Recall that the spectral gap 6 is defined as 



p : Pop=0 m 

I 



Note that 6 > 0, since 5 = would imply that Trp,/;/'] = for all which is impossible 

except in the trivial case of all Ilj = 0. Then 

- V Tr[Uipt] = - VfTrpiPoPt] + Tv[UiPopt{l - Po)] (75a) 



m ^ — ' m 

i I 



+ Tr[n,(l - Po)ptPo] + Tr[ni(l - Po)pt{l - Po)] 

TT[il-Po)pt]-^Tr 
m ^-^ 



-^\l-Po)pt{l-Po) 



Tr[(l - Po)pt 



(75b) 



> (1 - a) 5. (75c) 

Since a = TrfPoPt] is non-decreasing with this also holds for all Pr<t- 

So, the probability of a randomly chosen projector being violated by pt is upper-bounded by 
1— a, and the probability of a randomly chosen projector being violated by pr<t is lower-bounded 
by (1 — a)5. Imagine that we iterate the process for t iterations. Given that M violations occur, 
what is the probability that one of the violations occurs in the final {t^^) iteration? From 
Eqs. (73) and (75), the worst-case distribution of violations is given by 

Pr(violation in t^^ iteration) oc 1 — a, (76a) 
Pr (violation in r < t^^ iteration) oc (1 — a) 5. (^Ob) 

The probability of a violation in the final iteration is then equivalent to the probability 
of picking the t^^ element, when we pick M from t elements according to the distribution of 
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Eq. (76). By the union bound, this is at most M x Pr(pick t^^ element). Normahsing the 
distribution of Eq. (76), we have 

SO ^ 

Pr (violation in t^^ iteration | M violations) < ^ ^ . (78) 

We have a bound on the expected number of violations from Theorem 12. Using this together 
with Eq. (78), we have 

Pr(violation in t*^ iteration) < "^^ ^ Pr(M violations) = ^iMl < . (79) 

M 

From Eq. (75), we know that Pr(violation in t^^ iteration) > (1 — a) S. Together with Eq. (79), 
this implies 

Therefore, in order to achieve an overlap Tr[Po/Oi] > 1 — e with the steady-state subspace, we 
require t = 0{m/6'^e) iterations. □ 

Since the map £ of Theorem 28 implements exactly the dissipative state engineering map of 
Verstraete et al. [2009], Theorem 28 implies the following result about the convergence of this 
map. 

Corollary 29 Consider the many-body Hamiltonian H = Xll^i ^''^ ^ particles, where each 
local term hi acts on some subset of the particles, and all the hi mutually commute. Let Hi be 
the projector onto the orthogonal complement of the lowest-energy eigenspace of hi. 

If the projectors satisfy the Lovdsz conditions of Theorem 7, then the CP map £ defined in 
Theorem 28 will converge in time 0{m/6^£) to a state with fidelity 1 — e with the ground state 
subspace of H, where 6 is the spectral gap of the Hamiltonian Hji = ^ J2i which the local 

terms are replaced by projectors. 

Note that the convergence time is independent of the number of particles and of their local 
Hilbert space dimension. 



6 The Non-Commutative Case 

In the previous sections, we have overcome the first challenge to proving a constructive QLLL: 
that of efficiently constructing the desired entangled state using only local measurements. To 
prove a constructive version of the general QLLL of Ambainis et al. [2009], we must overcome 
the second challenge: non-commutativity of the projectors, and the concomitant measurement- 
disturbance problem. 

In the non-commutative setting, there is a new parameter that can play a role in the run- 
time of the QLLL algorithm: the spectral gap of the Hamiltonian associated with a QLLL 
instance. 

Definition 30 (QLLL spectral gap) The spectral gap 5 of a set of projectors {Hi} is the 
spectral gap of the associated Hamiltonian^ 

H=-y^Ui, (81) 

i 

^Note our choice of normalisation, which follows that of Ambainis, Kempe, and Sattath [2009], but differs 
from some papers in the literature by a factor of m. 
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i.e. the difference between the smallest and second-smallest eigenvalues (ignoring degeneracies). 

If the projectors satisfy the Lovdsz conditions (Definition 6), then the minimum eigenvalue 
of H is and the spectral gap can he expressed as 



where Pq is the projector on the subspace of states fulfilling the QLLL requirements, i.e. the 
projector onto span{|V') : VzIIj \tp) = 0}. 

Note that Eq. (82) can be interpreted as the probabihty of measuring a randomly chosen pro- 
jector to be violated, minimised over all states in the orthogonal complement of the subspace 
of states that do not violate any projector. 

6.1 Witness trees 

The proof of the key Lemma 8 for the commutative case in Section 3.1, leading to the bound 
in Theorem 12 on the expected number of violations seen by Algorithm 2, made crucial use of 
commutativity of the {Hj}. (Commutativity is also necessary for the alternative combinatorial 
proof given in Section 4.) In principle, Lemma 8 and hence Theorem 12 could still hold in the 
non-commutative case, without necessarily implying the same run-time as in the commutative 
case. We will see in Section 6.3 that, depending on the type of convergence we demand, the 
spectral gap enters naturally in the required run-time in the non-commutative setting, through 
using a bound on the expected number of violations to prove a bound the rate of convergence 
to the desired state. 

However, there are strong indications that the bound in Lemma 8 part (ii) on the probability 
of a witness tree r occurring in Algorithm 2's log no longer holds for non-commutative projectors. 
The proof of Lemma 8 still goes through in the commutative setting if, instead of constructing 
a witness tree by discarding any projectors that do not intersect with one already in the tree, 
we instead include these projectors, thereby constructing a DAG instead of a tree. This still 
encodes the partial ordering of the violations with respect to projector intersections, so the 
coupling argument in Section 3.1 still works, giving the analogous bound on the probability of 
these DAGs occurring in the log (still in the commutative case). Indeed, we proved exactly this 
result for resample DAGs in Corollary 21, by more involved linear algebra and combinatorial 
techniques. 

But the following counter-example shows that this bound on the probability of a given DAG 
occurring in the log no longer holds if the projectors do not commute. 

Example 31 Consider a system of two qubits. Let IIi = |0)(0| (g) 1, 112 = 1 |0)(0|, and 
Hs = \^p){^p\ + |01)(01| + |10)(10|, where = ^/a\00) + Vb\ll). Let r be the graph consisting 
of two (unconnected) vertices labelled by Hi and 112. 

If Corollary 21 or (the generalisation of) Lemma 8 were true in the non-commutative case, 
the probability of r occurring in Algorithm 2's log would be bounded by 



There are two ways r can occur: either the sequence 111,112 occured at the beginning of 
the log, or the same violations occurred in the other order 112, Hi. The probability of these two 
sequences is the same, by symmetry. Direct analytical calculation gives 



which violates the bound in Eq. (83) for a > ^(^41 — 1), with the probability tending to ^ 




(82) 



Pr(T) < Tr(ni/d)Tr(n2/d) = 1/4. 



(83) 




(84) 



as a — > 1 (a violation of j^)- 
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6.1.1 A Conjecture 

Nevertheless, we conjecture that a similar bound does still hold in general, weakened by a 
polynomial factor. 

Conjecture 1 Let {Hi} be an arbitrary set of projectors, let t be a fixed witness tree with 
vertices labelled by these projectors, and L the log produced by running Algorithm 2 with these 
projectors. We conjecture that the probability that r occurs in L is at most 

Pr(r) <poly(|r|)J]Pr[n(^;)], (85) 
where |r| is the size of t (total number of vertices). 

(We implicitly also allow here a polynomial dependence on the other parameters n, m, d, Xi 
and 1/5. Only the dependence on r plays a role in the subsequent proof; any dependence on 
the other parameters carries straight through to the final run-time.) 

In Section 6.2, we will see that this conjecture relates the probability of a tree occurring in 
the general quantum case to the expectation value of a functional over the total progeny of the 
Galton- Watson process described in Section 3.2. In the classical and commutative cases, we 
saw that this conjecture held for the trivial constant functional poly(|r|) = 1. 

For the simplest case of trees containing a single vertex, we can prove Conjecture 1 even 
without the polynomial factor (i.e. with polydrj) = 1). The single- vertex case is not completely 
trivial, as an arbitrary number of satisfied measurements can occur before Algorithm 2 sees its 
first violation. In fact, we will prove a slightly stronger operator bound, which immediately 
implies the conjecture for single-vertex trees. We already proved this result for commuting 
projectors in Lemma 16. The following proposition generalises that result to arbitrary sets of 
projectors. 

Proposition 32 Let {Ilj} be a set of m projectors on C^. Consider the following iterated 
measurement process, starting from the maximally mixed state. In each step, a projector Hi is 
chosen independently uniformly at random, and the two-outcome measurement {Hi, 1 — Hi} is 
performed. This is repeated until a Hi outcome is obtained, at which point the process halts. 

Let Pa denote the final state of the system given that it halted on outcome Ha, and pa the 
probability that this occurs. Then the (unnormalised) density matrix corresponding to the process 
halting on outcome Ha satisfies 

Xa = PaPa < Ha/d, (86a) 

hence 

Pa = Tv{Xa) < TT{Ha)/d. (86b) 

This result is quite striking. It tells us that we if start from the maximally mixed state and 
perform binary, two-outcome projective measurements at random until we obtain the first "0" 
outcome, we can effectively ignore all intermediate measurements withboutcome "1", and re- 
place this process with that of simply measuring the final outcome on the maximally mixed 
state directly. 

Proof We will in fact prove a slightly more general result. Let denote the Moore-Penrose 
pseudoinverse of a matrix A, and A* denote entry-wise complex conjugation. The operation 
vec{A) or \A) denotes vectorisation of the matrix A, i.e. the treatment of matrix A G MaiG) as 
a vector in C^^. By a slight abuse of notation, we sometimes write \X) < \Y) to mean X <Y. 
We can express the unnormalised density matrix Xa as 

Xa = -Haf2l -(i-nj---(j;-(i-n,jl(i-n,jy..(i-n,j|n„ (87) 

t=0 \ii,...,it \ ii / J 



31 



or, vectorising, 



m ^-^ \ m ^-^ I a 

But this is a Neumann series X^j^q ^* ~ (-"- . Thus the infinite sum can be written as 

\Xa) = -[n*®Uj](i--y^{i-n*)(g){i-Ui)] hi) (89) 
= (n*«)nj) ^^(i<8)ni) + (n*<8)i)-(n*<8)ni)^ (90) 

For a more compact notation, define Q = '^i^i, -Bi = H* Hj, B = ^^Bi, and A = 
X] .(1 (g> Hi + n* (g) 1) = Q* <8) 1 + 1 (8) Q. Then the claim of the Proposition is equivalent to 

Bi{A- B)-^\l) < Bi\l) . (91) 

We will prove a slightly more general result. Let P = Ilj^IIjj ■■■Hif, be a product of k 
commuting projectors chosen from {Hj}, and define Bp = P* ® P. We claim that 



i?p(^-B)-i|i)< Q + ^)spii) 



(92) 



the Proposition being equivalent to the k = 1 case of this. To show this, we need the following 
lemmas, which we prove below. 

Lemma 33 PQ~^P < ^P. 

Lemma 34 A~^ |1) = ^ IQ""^) + \Qi.), where Q± is an operator whose support (coimage) lies 
in the kernel of Q. 

Lemma 35 BpA~^ \1) < ^Bp \1), and in particular BA~'^\1) < \B\1). 
Lemma 36 BpA~^B\l) = \Bp\l), and in particular BA~'^B\1) = \B\1). 
Lemma 37 Let t > 1 be an integer. Then {BA~'^y \1) < ^B\l). 
Eq. (92) now follows from 

Bp{A - B)~^ \1) = Bp{A{l - A-^B))-^ \1) = Bp{l - A-^B)-^A-^ \1) (93a) 

oo oo 

= Bp J2{^~^B)'A-^ \1) = BpA'^ Y,{BA~y \1) (93b) 

t=0 t=0 

oo 

= BpA-^ 1 1) + BpA~^ Y,(P^~y 1 1) (93c) 

t=i 

oo ^ 

< BpA~^\l) + BpA~^^—B\l) (93d) 



t=i 



BpA-^ll) + BpA^^Bll) < 4- -Bp 11) + -Sp 11) (93e) 

2k 2 

5 + 1:) Bp ID. (M) 

where the first equality in Eq. (93a) follows as B is in the support of A, Eq. (93d) from Lemma 37, 
and the inequality in Eq. (93e) from Lemma 35 and Lemma 36. □ 
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We now prove the five lemmas used above. 



Proof (of Lemma 33) First note that 

Y,^t>kni^Ili,---Ui^ (94) 

i 

because 

Y,^i-k ni,ni2 • • • = Ui, + Ui, +Ui^ + J2 nj - kUi.Ui, • • • n^, (95) 

i j^{ii,---,ik} 

> kUi^ Hi, • • • Hi, kUi^ Hi, • • • Hi, (96) 

j^{ii,...,«fe} 

= ^n, >0, (97) 

where Eq. (96) follows from lower bounding each term Ilj^ by Ui^P, which is true for any 
projector. (Here, P = Y[n=i ^ projector since the Ilj^ commute by assumption.) But this 

implies 

(E^O ' - = \P, (98) 

i 

and the lemma follows by left- and right-multiplying by P. □ 

Proof (of Lemma 34) We consider the equation A~'^ |1) = \(t) and solve for the unknown 
\a). By first multiplying with A from the left, we see that AA~^ |1) = A|cr), where AA~^ 
is the projector onto the support of A. Furthermore, note that AA~^ = {QQ~^)* QQ~^, 
thus AA~^ |1) = \QQ~^). Therefore, written in unvectorised form, we have to solve the matrix 
equation Qa + aQ = QQ~^. Clearly the restriction of a to the support of Q can only be ^Q~^, 
but we can add an arbitrary operator in the kernel of Q. □ 

Proof (of Lemma 35) By Lemma 34, BpA'^ |1) = lBp\Q-^) =: \a), using the fact that 
P is in the support of Q. Thus, unvectorised, we have ^PQ~^P = a. But by Lemma 33, 
\PQ~^P < j^P, thus \a) < \P) = j^Bp |1). The second part of the lemma follows simply 
by summing the k = 1 case over i. □ 

Proof (of Lemma 36) First, note that |1) = \Q), thus BpA-^B\l) = BpA'^ \Q). Defining 
\Q) —■ W)j we can multiply with A from the left and solve AA~^ \Q) = A\a) for \a). Now 
note that AA''^ \Q) = [{QQ^^)* ® QQ"^] \Q) = \Q)- Unvectorising, we have Qa + aQ = Q, 
and clearly a = ^1 + Q±, where we can add an arbitrary operator Q± in the kernel of Q. The 
lemma follows easily when we recall that P is in the support of Q. □ 

Proof (of Lemma 37) First we bound 

{BA-y\l) = {BA~^y~^BA~^ |1) < ^{BA~y-^B\l) (99) 
using Lemma 35. To show {BA^^Y^^B |1) = i^^B |1) by induction, it suffices to show 

BA-'^B\1) = ]^B\1) , (100) 
which is then iterated t — \ times. But this is already established by Lemma 36. □ 
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6.1.2 A Weaker Conjecture 

Conjecture 1 weakens the bound we proved in the commutative case by a polynomial factor. In 
fact, we will see that we can even cope with an exponential factor, and still prove a constructive 
QLLL with polynomial run-time, albeit at the expense of having to strengthen the Lovasz 
conditions in the non-commutative case. (We recover the original Lovasz conditions if the 
projectors commute.) 

Conjecture 2 Let {Hi} be an arbitrary set of projectors, let t be a fixed tree with vertices 
labelled by these projectors, and L the log produced by running Algorithm 2 with these projectors. 
We conjecture that the probability that r occurs in L is at most 

Pr(r)<-^nP^[nW]' (101) 

where |r| is the size of t (total number of vertices). 

The motivation for this conjecture comes from the following result, which once again proves 
the conjecture for the simplest case of trees containing a single vertex. 

Proposition 38 Let {Hi} be a set of in projectors on (C^, and define 6 to be the spectral gap 
of the associated Hamiltonian 

H=-y2Ui. (102) 

i 

Consider the following iterated measurement process, starting from the maximally mixed state. 
In each step, a projector Ilj is chosen independently uniformly at random, and the two- outcome 
measurement {Ilj, 1 — Ilj} is performed. This is repeated until a Ilj outcome is obtained, at 
which point the process halts. 

Let Pa denote the final state of the system given that it halted on outcome Ha, and pa the 
probability that this occurs. Then the (unnormalised) density matrix corresponding to the process 
halting on outcome Ha satisfies 

Xa = PaPa < -^Ha/d, (103a) 

mo 

hence 

Pa = Tr{Xa)<—^^^. (103b) 
mo a 

For commuting projectors, we always have 5 > 1/m (and equality holds unless every state 
outside the kernel of H violates more than one projector), and we recover Lemma 16. For 
non-commuting projectors, 5 can range between and 1, but we would expect the more difficult 
cases to be when 6 < 1/m, in which case Proposition 38 is a weaker statement than that already 
proven in Proposition 32. 

Nonetheless, Proposition 38 is substantially easier to prove than Proposition 32, which 
could suggest that Conjecture 2 is the "correct" non-commutative generalisation of the bound 
in Lemma 8, and Proposition 32 is a red-herring that only applies to the very special case of 
single- vertex trees. 

Proof Define for the linear map 

-^(P) — ^ Ei(l -ni)p(l -Hj), and let M* denote the t-Md 
composition of Ai. Then the unnormalised density matrix Xa is given by 

Xa = -Haf2l Yl -(i-n.J---(5^-(i-n,jl(i-njjy..(i-njj|n„ (io4a) 

t=0 \ii,...,it \ h / / 

= ^Ha M\l/d)j Ha. (104b) 
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Let G be the projector onto the ground state subspace (kernel) of H, and E = be 
the projector onto the subspace of excited states (the support, or coimage, of H). Note that 
Vi : IljG = Gllj = and HiE = Elii = Ilj. For any scalars q, /3, 



MiaG + f3E) = — V(l - Ui)(aG + ^E)(l - Ui) = aG + (3— V(S - Hj) (105a) 
m ^-^ m ^-^ 

i=l i=l 

= aG + pE-H <aG + p{l-5)E. (105b) 
Applying this operator inequality t times starting from a = (3 = 1 gives 

M\l)<G+{l-5)*E. (106) 

Using this in Eq. (104b) leads to 

oo 1 °° 1 1 

t=0 i=0 

which proves the proposition. □ 
6.2 Expected number of violations 

From the conjectures, we can derive bounds on the expected number of violations seen by 
Algorithm 2. 

Definition 39 (e-strengthened Lovasz conditions) Let ni,n2,...,nm he projectors that 
act on arbitrary subsets of n qudits. We say that the set of projectors {Hi} satisfies the e- 
strengthened Lovasz conditions if there exist values < xi,X2, ■ ■ ■ ,Xm < 1 such that 

R{Ui) <{l-e)xi-Yl{l-Xj). (108) 

n^ercn,) 

Theorem 40 Let {Hi} be a set of m projectors satisfying the e-strengthened Lovasz conditions 
of Definition 39 for some e > 0. // Conjecture 1 is true, then there exists a constant C (which 
depends on e) such that the expected number of violations seen by Algorithm 2 is bounded by 

m 

Xi 



E(total number of violations) < C ^ - — - — . (109) 

. . -L Xi 



i=l 

Proof The proof is very similar to that of Theorem 12 in Section 3.2 for the commutative case. 
Let Na be the number of times a given projector XIq appears in Algorithm 2's log, which is the 
same as the number of times Ha is measured to be violated. Let Ta denote all proper witness 
trees whose root vertex is labelled by Ila. Then, using Conjecture 1, we have 

E{Na) = ^ Pr(T appears in the log) < ^ poly(|r|) JJ Pr[n(7;)]. (110) 

The probability Pr[nj] of a projector being violated on a random state is just given by its relative 
dimension Pr(nj) = R(ni). Since by assumption the projectors satisfy the Lovasz conditions of 
Definition 39, we have 

ma) < E poiy(i^i) n ^(n^) ^ E p°iy(i^i) 11 - ^)^^ (i^^^) 

reTa HiGr reTa UiGr 

^ E P°ly(l^l)(l - Pr(r), (111b) 

T&Ta 
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where Pr(ra) is the probabihty that the Galton- Watson process defined in Lemma 11 generates 
tree Tq. The inequality in Eq. (111b) follows from Lemma 11. 

The summand in Eq. (111b) is just the expectation of a functional f{x) = poly(a;)(l — e)^ 
over the total progeny \t\ of the Galton- Watson process. For our purposes, it sufficies to observe 
that this functional is always bounded. Let C := msiXx>o f (x) ■ Then 

ma) < E CPr(r) < C-^, (112) 

the final inequality coming from the fact that the Galton- Watson process either produces a tree 
in Ta, or continues indefinitely. The theorem follows from summing over all projectors. □ 

In fact, to prove Theorem 40, all we require is that the expectation of the functional poly(|r|) 
of the total progeny r of the Galton- Watson multi-type branching process be bounded. It is 
therefore likely that the 1 — e factor can be removed by a more careful analysis of the distribution 
of total progeny for the branching process. 

Theorem 41 Let {Hi} be a set ofm projectors with spectral gap 5, which satisfy the e- strengthened 
Lovdsz conditions for e = 1 — m5. If Conjecture 2 is true, then the expected number of violations 
seen by Algorithm 2 is bounded by 

m 

E(total number of violations) < > — . (113) 

1-Xi 

1=1 

Proof As in Theorem 40, 



^ ^ (mS)\ 



E{Na) = y Pr(T appears in the log) < V , \, TT Pr[n(?;)], (114) 

reTa 

this time using Conjecture 2. Thus 

nN,) s ^ E P.(.) ^ ^ E , (115) 



and the theorem follows by summing over all projectors. □ 



6.3 Converging to a solution 

In the non-commutative setting, there is an ambiguity in what it means for an algorithm to 
construct a state satisfying the requirements of the QLLL of Theorem 3. The non-constructive 
QLLL asserts the existence of a state that does not violate any of the projectors. We could 
demand that the algorithm converges to a state whose probability of violating any projector is 
at most e; we will call this "weak convergence", or "convergence in energy". Alternatively, we 
could demand that the algorithm converges to a state that is e-close (in some suitable distance 
measure, say fidelity) to the subspace of states that do not violate any projector; we will call this 
"strong convergence" , or "convergence in fidelity" . In the classical and in the commutative cases, 
these are equivalent, and there is a single, unambiguous definition of convergence. However, non- 
commutativity of the projectors means that these two notions of convergence are in general no 
longer equivalent in the quantum case. 

Definition 42 (Strong convergence, or convergence in fidelity) We say that an algorithm 
converges in the strong sense to a state fulfilling the QLLL if, given any e > 0, there exists a 
t such that the state p produced by running the algorithm for time t satisfies Tr[PQp] > 1 — e, 
where Pq is the projector onto the subspace of states defined by span{|V') : Villj \^p) = 0}; i.e. 
the state p is e-close in fidelity to the subspace of states fulfilling the QLLL requirements. 
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Definition 43 (Weak convergence, or convergence in energy) We say that an algorithm 
converges in the weak sense to a state fulfilling the QLLL if, given any e > 0, there exists a t 
such that the state p produced by running the algorithm for time t satisfies \/i Trpjp] < e. 

As our terminology suggests, strong convergence implies weak convergence, but in general 
the converse does not hold. 

Lemma 44 Strong convergence is strictly stronger than weak convergence. 

Proof It is trivial to see that strong convergence implies weak convergence: if p is such that 
Tr[Po/o] > 1 - e/m, then 

^Tr[n,p] = 5]Tr[n,(l - Po)p] < ^^^[(1 " ^o)p] < e (116) 

i i i 

(the first equality following from the fact that IljPo = by definition, the middle inequality 
from the operator inequality Ilj < 1). 

To see that weak convergence is strictly weaker, consider the example of two almost-identical 
projectors acting on the same set of qudits: XIq = |V')(^I ^iid Hi = with | (V'lV'') P = 

1-5. Let 

1^,.) ^ (i-IWjj)!^') 

yl — 6 

be the state in the span of , orthogonal to |^). For p = we have 

Tr[no^] = |(V'|^^)P = 0, (118) 
Trpip] = I (W^) |2 = I (V^'Kl - |^)(V|)|^') ? = (119) 

whereas Tr[Pop] = 0. Therefore, letting 5 tend to 0, an algorithm which converges to the state 
p has converged to arbitrarily high precision in the weak sense, yet it is as far as possible from 
convergence in the strong sense. □ 

However, weak convergence does imply strong convergence if we are prepared to pay a price 
in the convergence time. That price is a dependence on the spectral gap of Definition 30. 

Lemma 45 // an algorithm converges weakly in time 0{f{l/e)), then it converges strongly in 
time 0{f{\/m6e)). 

Proof If we run the algorithm for time 0(/(l/m(5e)) then, by Definition 43 of weak convergence, 
the state p produced by the algorithm must satisfy \/i : Tr[nj/3] < mSe. Decomposing the state 
p as 

p = PopPo + Pop{l - Po) + (1 - Po)pPo + (1 - ^^0)^(1 - Po), (120) 
and noting that IljPo = by definition, we have 

Se > -Y^TviUip] 

i 

= ^ E(Tr[n,Popi='o] + Tr[n,Pop(l - ^'o)] (121a) 

i 

+ T:[n,(l - Po)pPo] + Tr[ni(l - Po)/o(l - Po)]] 
= - VTr[ni(l-Po)/o(l-^o)]. (121b) 



m 
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But, by Definition 30 of the spectral gap 6, 



m 



1 



E 



Tr[n,(l-Po)p(l-Po)] 
Ti:[(l - Po)p] 



(122) 



tfius from Eqs. (121) and (122) we obtain Tr[(l — Po)p] < or 



TV[Pop] > 1 - e, 



(123) 



proving strong convergence in time 0{f{l/m5e)) as claimed. 



□ 



Which is the "correct" notion of convergence depends on what one is trying to achieve. From 
a physics perspective, an algorithm for finding a state fulfilling the Lovasz conditions can be 
interpreted as an algorithm for finding the ground state of the Hamiltonian 



If we are interested in cooling a system to the ground state, producing a state with low energy is 
sufficient and weak convergence is the appropriate notion, since the parameter e in Definition 43 
upper-bounds the energy of the state with respect to the Hamilton H. On the other hand, if 
we are interested in ground state properties, we want the algorithm to produce a state close in 
fidelity to the true ground state, requiring strong convergence. 

If the spectral gap scales inverse-polynomially, then the run-time remains polynomial even 
for strong convergence. For some important cases of the QLLL, the gap does indeed scale 
inverse-polynomially, and the run-time is provably polynomial even for strong convergence. (For 
example, in the commutative case the gap is constant and we have already seen that an efficient 
algorithm always exists.) On the other hand, there are certainly cases for which the spectral gap 
is exponentially small, and in that case the run-time we require to ensure strong convergence 
will be exponentially large. However, an inverse-linear dependence on the gap may be the best 
one could hope to achieve, at least for algorithms that work by measuring the projectors, as it 
takes expected time 1/(5 to see even one violation if the system is in the lowesr excited state (a 
state necessarily orthogonal to the ground-state subspace). 

From a complexity theoretic perspective, weak convergence is the more natural notion, as it 
corresponds to the canonical FQMA-complete problem of low-energy state preparation [Janzing 
et al., 2003]. (FQMA is the quantum analogue of FNP, the functional version of NP; whereas 
an NP or QMA problem asks whether there exists an input producing a "yes" answer, an FNP 
or FQMA problem requires such an input to be prepared.) Similarly, the fc-QSAT problem, as 
defined and shown to be QMAo-complete by Bravyi [2006], promises that a given fc-local Hamil- 
tonian either has a ground state (minimum eigenvalue eigenstate) with exactly zero energy, or 
has a ground state with energy larger than a (where a is inverse polynomial in the problem size) . 
The /c-QSAT problem is then to decide whether a zero-energy state exists. The corresponding 
FQMA problem is to produce a state that witnesses the existence of a zero-energy state. Setting 
£ < a and running an algorithm until it weakly converges to precision e produces precisely such 
a witness. 

On the other hand, formulating strong convergence as a well-defined complexity-theoretic 
concept requires a little care. For all the standard complexity classes, the projectors defining 
a problem instance must be specified classically. Thus if the problem size is to be well-defined, 
the projectors can only be specified to finite precision. However, there exist instances which are 
arbitrarily close to each other (the matrix elements of the two sets of projectors are arbitrarily 
close), yet a state satisfying the strong convergence requirements for one instance is maximally 
far from satisfying those conditions for the other instance. (The two-projector example consid- 
ered in Lemma 44 provides an instance of this. By making 5 arbitrarily small, the example can 




(124) 
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be made arbitrarily close to a QLLL problem consisting of two identical projectors Ho. Yet the 
state IV'"'") J which satisfies the strong convergence requirements when both projectors are Ho, is 
as far as possible for satisfying those requirements for the instance given in Lemma 44. In order 
for any algorithm to determine that IV""*") is not a valid solution, the projectors would have to 
be specified to arbitrarily high precision.) 

To make the strong convergence case into a well-defined complexity-theoretic problem, we 
must either formulate it as a weak-membership problem [Grotschel et al., 1988], with a parameter 
6 specifying the precision to which the answer must be given, or equivalently we can formulate 
it as a promise problem, with a promise on the spectral gap (cf. Cubitt et al. [2009]). Either 
way, this means that the problem instance itself sets a lower-bound on the spectral gap. But 
Lemma 45 shows that an efficient algorithm for weak convergence already implies an efficient 
algorithm for the complexity-theoretic formulation of strong convergence in this case. So in the 
most reasonable complexity-theoretic formulation, strong convergence and weak convergence 
are equivalent. 

6.4 An efficient quantum algorithm 

In Section 3.3, we used the bound from Theorem 12 together with Theorem 13 to prove that 
Algorithm 4 converges efficiently to the desired state. But Theorem 13 applies to any set of 
projectors; it does not depend on commutativity. Therefore, the analogous non-commutative 
bounds in Theorems 40 and 41 (which depend on Conjectures 1 and 2) together with Theorem 13 
imply that Algorithm 4 converges efficiently (in the weak sense) in the non-commutative case. 
Lemma 45 extends this to strong convergence, proving the following constructive results in the 
general, non-commutative case of the Quantum Lovasz Local Lemma. (The existence parts of 
the following theorems are true independent of the conjectures, by Theorem 3.) 

Theorem 46 (Constructive QLLL) Let {Hi} be a set of m projectors projectors acting on 
subsets of n qudits, with spectral gap 5 (Definition 30). //{Ilj} satisfy the e- strengthened Lovasz 
conditions (Definition 39) for any e > 0, then there exists a joint state p of the qudits such that 
Vi : Tr[n,p] = 0. 

Moreover, if Conjecture 1 holds, there is a quantum algorithm that constructs a state p' such 
that Vi Tr[njp'] < e, in time 



where \[i]\ is the number of qudits on which the projector Ilj acts non-trivially, thus p' has 
probability at most e of violating any of the constraints defined by the Ilj. 

The same algorithm constructs a state p" such that Tr[Pop"] > 1— £; where Pq is the projector 
onto the subspace spanjlV') : Villj {ip) = 0}, in time 



thus p" is £-close in fidelity to the subspace of states satisfying Villj \ip) = 0. 

It may well be possible to remove the 1 — e strengthening of the Lovasz conditions, by a more 
refined analysis of the branching process in Theorem 40. (See discussion after the proof of that 
theorem.) 

Theorem 47 (Constructive QLLL, strengthened Lovasz conditions) 

Let {Ilj} be a set ofm projectors projectors acting on subsets ofn qudits, with spectral gap 6 (Def- 
inition 30). If {Hi} satisfy the e- strengthened Lovasz conditions (Definition 39) for e = 1 — mS, 
then there exists a joint state p of the qudits such that Vi Tr[nj/3] = 0. 
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Moreover, if Conjecture 2 holds, there is a quantum algorithm that constructs a state p' such 
that Ti[Ilip'] < e, in time 



where \[i]\ is the number of qudits on which the projector Hi acts non-trivially, thus p' has 
probability at most e of violating any of the constraints defined by the Hi. 

The same algorithm constructs a state p" such that Ti[Pqp"] > 1 — e, where Pq is the projector 
onto the subspace span{|^) : Villj |^) = 0}, in time 



thus p" is e-close in fidelity to the subspace of states satisfying Villj = 0. 

Note that all the arguments of Section 5 go through for the non-commutative case, so that 
Theorems 46 and 47 lead directly to CP map convergence results for more general classes of 
maps. This generalises Theorem 28 and Corollary 29 to non-commutative local Hamiltonians, 
and their associated dissipative state engineering CP maps. 

7 Conclusions 

We have proven that a simple quantum algorithm (Algorithm 4) efficiently constructs a state 
satisfying the requirements of the Quantum Lovasz Local Lemma (Theorem 3) of Ambainis et al. 
[2009] in the setting of commuting constraints. Not only does this give an efficient algorithm 
for constructing the quantum state whose existence is guaranteed by the commutative QLLL. 
In fact, since we do not assume the QLLL in order to prove that the algorithm finds the state, 
this also gives an independent, constructive proof of the commutative QLLL itself. 

Until Beck [1991] provided the first algorithm, it was not a priori clear whether the combi- 
natorial objects whose existence was guaranteed by the classical LLL could also be constructed 
efficiently. In the quantum case, it was perhaps even less obvious whether the states guaranteed 
to exist by the QLLL could be prepared efficiently, as those states can have a highly complex 
entanglement structure. Our result gives a new quantum algorithm for efficiently constructing 
these complex states in the commutative case. Or, in physics terms, our result provides a new 
method of cooling to the ground state of certain many-body Hamiltonians with commuting local 
terms. 

In the non-commutative setting, we can only prove the constructive QLLL modulo a tech- 
nical conjecture that the probability of witness trees is at most polynomially weaker than the 
bound we have proven in the commuting case (Conjecture 1). If we impose stronger Lovasz 
conditions (which reduce to the usual conditions in the commutative case), we can prove a 
constructive QLLL using a weaker conjecture (Conjecture 2), which weakens the commuta- 
tive bound by an exponential factor in the non-commutative setting. We have given a simple 
counter-example showing that the bound from the commutative case does not hold in general. 
The violation of the commuting bound is directly attributable to the measurement-disturbance 
effect of non-commuting quantum measurements, which means that even "satisfied" outcomes 
can disturb the state in an undesirable way. 

We can prove both our conjectures in the simplest case of single- vertex trees. This is already 
non-trivial, as it shows that the "satisfied" measurements that can cause so much trouble in 
the non-commutative case can effectively be ignored up to the first violation. However, as our 
counter-example demonstrates, the case of multiple violations is substantially different, and 
proving the conjectures in general is likely to require new ideas and techniques. Our conjectures 
are supported by numerical evidence for some small multi- vertex trees, though the numerics 
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we have done are very limited. However, it is very likely that the conjectures must hold if 
the natural quantum generalisation of Moser's algorithm (Algorithm 2) is to work. Since the 
probability of witness trees occurring in the algorithm's log is directly related to the expected 
number of violations seen by the algorithm (Theorems 12, 40 and 41), if the witness tree 
probability does not shrink fast enough with the size of the tree, the expected number of 
violations will be unbounded. Thus disproving our conjectures would strongly suggest that an 
entirely different approach to the one pioneered by Moser in the classical setting is required in 
the non-commutative quantum setting. (Or that there is no efficient constructive version of the 
general QLLL.) 

In the classical case, Moser and Tardos [2010] impose a slight restriction on the events that 
feature in the LLL, requiring that they be determined by an underlying set of random variables. 
In the quantum case, we imposed the analogous restriction, requiring that the subspaces in the 
QLLL are defined on an underlying set of qudits. Kolipaka and Szegedy [2011] have generalised 
the original Moser and Tardos [2010] results, and removed this restriction in the classical case. 
Their results also prove a constructive algorithm right up to the Shearer bound, the tightest 
possible version of the Lovasz conditions. Finally, they also generalise an earlier result of 
Haeupler et al. [2010], showing that the Moser algorithm is efficient in the number of variables 
even if the number of events is super-polynomial. It seems likely that these results can be 
generalised to the quantum setting, at least in the commutative case. In particular, this would 
remove the assumption of an underlying tensor product structure, proving a constructive version 
of the QLLL for general subspaces, matching the formulation of Ambainis et al. [2009]. 

Although not expressed in this way in the published versions of the papers, Moser pointed 
out that his constructive proof [Moser, 2009] of the symmetric Lovasz Local Lemma (Corollary 2) 
can be formulated as an elegant compression argument [Moser, 2010, Tao, 2009]: the sequence of 
random bits used by the algorithm can be recovered perfectly from the data in the execution log 
and final output, but if the length of the log grows indefinitely, then this data can be compressed 
into fewer bits than the entropy. The strong converse of Shannon's noiseless coding theorem 
(see e.g. Cover and Thomas [2006]) imposes an exponential suppression of the probability of 
compressing below the entropy, implying a linear bound on the expected length of the log, hence 
also the run-time. This compression argument can be extended to the general Lovasz Local 
Lemma, and it is reasonably clear [Moser, 2010] that with a little effort it could even give tight 
constants in the general LLL (Theorem 1). Indeed, the witness trees and coupling argument of 
Moser and Tardos [2010] essentially contains an implicit compression argument, but elegantly 
side-steps the necessity of designing an explicit compression scheme for the log. Our witness 
tree and quantum coupling argument again contains an implicit compression argument. But 
the compression argument can also be generalised to the quantum case explicitly, either making 
use of the strong converse of the Schumacher noiseless coding theorem [Ogawa and Nagaoka, 
1999, Winter, 1999], or using a more general information-theoretic analysis [Sattath and Arad]. 
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